- tz database
-
The tz database, also called the zoneinfo database or IANA Time Zone Database, is a collaborative compilation of information about the world's time zones, primarily intended for use with computer programs and operating systems.[1] It is sometimes referred to as the Olson database after the founding contributor Arthur David Olson.[2] Paul Eggert is editor and maintainer of the tz database.[3]
Its most recognizable feature is the uniform naming convention designed by Paul Eggert for time zones, such as "America/New_York" and "Europe/Paris" (see List of tz database time zones).[4] The database attempts to record historical time zones and all civil changes since 1970, the Unix time epoch.[5] It also includes transitions such as daylight saving time, and even records leap seconds.[6]
The tz database partitions the world into regions where local clocks have all been the same since 1970. This map, taken from the 2009r edition of the database, is of all the regions outside Antarctica.[7]Contents
History
The project's origins go back to at least 1986.[8] The project's database, as well as some reference source code, is in the public domain.[9] New editions of the database are published as changes warrant, usually several times per year.[10]
2011 lawsuit
On September 30, 2011, a lawsuit, Astrolabe, Inc. v. Olson et al.,[11][12] was filed concerning copyright in the database. As a result, on October 6, 2011, the database's maintenance (mailing list) and dissemination (FTP site) operations were shut down.[13] The case revolves around the use by the database maintainers of the atlases The American Atlas by Thomas G. Shanks and The International Atlas by Thomas G. Shanks and Rique Pottenger. It specifically complains of unauthorised reproduction of the atlases' data in the timezone mailing list archive and in some auxiliary link collections maintained with the database, but doesn't actually point at the database itself. The complaint relates only to the compilation of historical timezone data, and does not cover current tzdata world timezone tables.[14][12][15] The tz database clearly references its sources, including the atlas, in comments, allowing the extent of use of the data to be evaluated.[16][17]
Move to ICANN
ICANN took responsibility for the maintenance of the database on October 14, 2011.[18] The full database (including allegedly-infringing sections) and a description of current and future plans for its maintenance are available online from IANA.[19]
Data structure
File formats
The tz database is published as a set of text files which list the rules and zone transitions in a human-readable format. For use, these text files are compiled into a set of platform-independent binary files—one per time zone. The reference source code includes such a compiler called zic (zone information compiler), as well as code to read those files and use them in standard APIs such as
localtime()
andmktime()
.Definition of a time zone
Within the tz database, a time zone is any national region where local clocks have all agreed since 1970.[20] This definition concerns itself first with geographic areas which have had consistent local clocks. This is different from other definitions which concern themselves with consistent offsets from a prime meridian. Therefore each of the time zones defined by the tz database may document multiple offsets to UTC, typically containing both the standard time and the daylight saving time in the same zone.
Names of time zones
The time zones have unique names in the form "Area/Location", e.g. "America/New_York", in an attempt to make them easier to understand by humans. A choice was also made to use English names or equivalents, and to omit punctuation and common suffixes. The underscore character is used in place of spaces. Hyphens are used where they appear in the name of a location.
Area
Area is either the name of a continent and ocean or "Etc". The set of continents and oceans currently include: Africa, America, Antarctica, Arctic, Asia, Atlantic, Australia, Europe, Indian, and Pacific.
The special area of Etc is used for some administrative zones, particularly for "Etc/UTC" which represents Coordinated Universal Time. In order to conform with the POSIX style, those zone names beginning with "Etc/GMT" have their sign reversed from what most people expect. In this style, zones west of GMT have a positive sign and those east have a negative sign in their name (e.g "Etc/GMT-14" is 14 hours ahead/east of GMT.)
Location
Location is the name of a specific location within that region, usually cities or small islands.
Country names are not used in this scheme, primarily because they would not be robust due to frequent political and boundary changes. The names of large cities tend to be more permanent. However, the database maintainers attempt to include at least one zone for every ISO 3166-1 alpha-2 country code, and a number of user interfaces to the database take advantage of this. Additionally there is a desire to keep locations geographically compact so that any future time zone changes do not split locations into different time zones.[citation needed]
Usually the most populous city in a region is chosen to represent the entire time zone, although other cities may be selected if they are more widely known or result in a less ambiguous name.[citation needed] In the event that the name of a city changes, the convention is to create an alias[citation needed] in future editions so that both the old and new names refer to the same database entry.
In some cases the Location is itself represented as a compound name, for example the time zone "America/Indiana/Indianapolis". The only three-level names currently include those under "America/Argentina/...", "America/Kentucky/...", "America/Indiana/...", and "America/North_Dakota/...".
The location selected is representative for the entire area.
On 2010-05-01 Arthur David Olson mentions a 14 character limit,[21] to justify dropping "de" as in the name of Bahia de Banderas and using only "Bahia_Banderas" for the identifier America/Bahia_Banderas.
Examples
America/Costa_Rica name of country used because the name of the largest city San José is ambiguous America/New_York Space replaced with underscore Asia/Sakhalin name of island used, because largest city, Yuzhno-Sakhalinsk, has more than 14 characters America/Bahia_Banderas name of largest city altered, "de" removed from Bahia de Banderas, because correct name has more than 14 characters Antarctica/DumontDUrville the apostrophe is removed. Removal of space not conforming to the rule that requires replacement with "_", but with "_" the name would have 15 chars Data stored for each zone
For each time zone that has multiple offsets, usually the standard and daylight variants, the tz database records the exact moment of transition. The format can accommodate changes in the dates and times of transitions as well.
Zone.tab
The file zone.tab is in the public domain and lists the zones. Columns and row sorting are described in the comments of the file, as follows:
# This file contains a table with the following columns: # 1. ISO 3166 2-character country code. See the file `iso3166.tab'. # 2. Latitude and longitude of the zone's principal location # in ISO 6709 sign-degrees-minutes-seconds format, # either +-DDMM+-DDDMM or +-DDMMSS+-DDDMMSS, # first latitude (+ is north), then longitude (+ is east). # 3. Zone name used in value of TZ environment variable. # 4. Comments; present if and only if the country has multiple rows. # # Columns are separated by a single tab. # The table is sorted first by country, then an order within the country that # (1) makes some geographical sense, and # (2) puts the most populous zones first, where that does not contradict (1).
Data before 1970
Data before 1970 aims to be correct for the city identifying the region, but is not necessarily correct for the entire region. This is because new regions are created only as required to distinguish clocks since 1970.
For example, between 1963-10-23 and 1963-12-09 in Brazil only the states of Minas Gerais, Espirito Santo, Rio de Janeiro and São Paulo had summer time, but on request a split from America/Sao_Paulo was rejected with the reasoning that since 1970 the clocks were the same in the whole region.[22]
Time in Germany, which is represented by Europe/Berlin, is not correct for the year 1945 when the Trizone used different daylight saving time rules than Berlin.[citation needed]
Coverage
Zones covering multiple post-1970 countries
There are two zones that cover an area that was covered by two countries after 1970. The tzdata follows the definitions of countries as per ISO 3166-1, whose predecessor ISO 3166 was first published in 1974.
- Asia/Aden - two countries until 1990: North Yemen (Aden, People's Republic, ISO 3166-1: YE) and South Yemen (Sanaa, ISO 3166-1: YD, ISO 3166-3 YDYE).
- Europe/Berlin - two countries until 1990: East Germany (ISO 3166-1: DD, ISO 3166-3: DDDE) and West Germany (ISO 3166-1: DE)
Maintenance
The tz reference code and database is maintained by a group of volunteers. Arthur David Olson makes most of the changes to the code, and Paul Eggert to the database. Proposed changes are sent to the tz mailing list, which is gatewayed to the comp.time.tz Usenet newsgroup. Source files are distributed via the FTP server elsie.nci.nih.gov. Typically, these files are taken by a software distributor like Debian, compiled, and then the source and binaries are packaged as part of that distribution. End users can either rely on their software distribution's update procedures, which may entail some delay, or obtain the source directly from ftp://elsie.nci.nih.gov/pub/ and build the binary files themselves. A future maintenance plan based on similar principles has been drafted through the IETF. Since 14 October IANA acts as a repository for the TZ database and associated reference code.[23]
Pursuant to a lawsuit,[12] the FTP server and mailing list have been shut down.[13]
Unix-like systems
The standard path for the timezone database is /usr/share/zoneinfo/ on most Unix-like systems, including Linux distributions. The file command has support for displaying the binary timezone files in a human-friendly textual form built-in:
$ file /usr/share/zoneinfo/Europe/Berlin
/usr/share/zoneinfo/Europe/Berlin: timezone data, version 2, 8 gmt time flags, 8 std time flags, no leap seconds, 144 transition times, 8 abbreviation charsUsage and extensions
Boundaries of time zones
Geographical boundaries in form of coordinate sets are not part of the tz database, but boundaries are published by Eric Muller[7] in the form of vector polygons. Using these vector polygons, one can determine, for each place on the globe, the tzdata zone in which it is located.
Use in other standards
CLDR uses UN/LOCODEs to identify regions.[24] This means all identifiers are referencing a country, something that the creators of the tz database wanted to avoid.
Use in software systems
The tz database is used for time zone processing and conversions in many computer software systems, including:
- BSD-derived systems, including FreeBSD, NetBSD, OpenBSD, DragonFly BSD, and Mac OS X;
- the GNU C Library and systems that use it, including GNU, most Linux distributions, BeOS, Haiku, Nexenta OS, and Cygwin;
- System V Release 4-derived systems, such as Solaris and UnixWare;
- AIX 6.1 and later[25][26] (earlier versions of AIX, starting with AIX 5.2, include zoneinfo,[27] for support of third-party applications such as MySQL,[28] but do not use it themselves[27][29]);
- several other Unix systems, including Tru64, and UNICOS/mp (also IRIX, still maintained but no longer shipped);
- OpenVMS;
- the Java Runtime Environment since release 1.4 (2002);
- the Perl modules DateTime::TimeZone and DateTime::LeapSecond since 2003;
- PHP releases since 5.1.0 (2005);
- the Python module pytz;
- the .NET Framework module zoneinfo;
- Oracle releases since 10g (2004);[30]
- PostgreSQL since release 8.0 (2005);
- embedded software such as the firmware used in IP clocks.
The Olson timezone IDs are also used by the Unicode Common Locale Data Repository (CLDR) and International Components for Unicode (ICU). For example, the CLDR Windows → Tzid table maps Microsoft Windows time zone IDs to the standard Olson names.[31]
See also
- List of tz database time zones
- Time zone
- Daylight saving time
References
- ^ Eggert, Paul; Olson, Arthur David (2007-11-29). "Sources for time zone and daylight saving time data". http://cs.ucla.edu/~eggert/tz/tz-link.htm. Retrieved 2007-12-03.
- ^ Olson, Arthur David (1986-12-16). "Resolved timezone issue? Other issues. New ctime manual page". tz mailing list.
- ^ Eggert, Paul (2005-01). "Re: FW: IANA time zone registration - proposal". tz mailing list. http://osdir.com/ml/time.tz/2005-01/msg00027.html.
- ^ Eggert, Paul (1993-10-20). "proposal for time zone names". tz mailing list.
- ^ Olson, Arthur David (1987-03-18). "Re: ist of issues". tz mailing list.
- ^ Devine, Bob (1988-06-02). "leap seconds; [0-60] is ok". tz mailing list.
- ^ a b Muller, Eric. A shapefile of the TZ timezones of the world [updated 2009-11-23].
- ^ Olson, Arthur David (1986-11-24). "seismo!elsie!tz ; new versions of time zone stuff". tz mailing list.
- ^ Eggert, Paul (1995-11-11). "questions and comments on http://tycho.usno.navy.mil/tzones.html". tz mailing list.
- ^ "zoneinfo tzcode and tzdata archives (FTP)". ftp://munnari.oz.au/pub/oldtz/. Retrieved 2007-10-30.
- ^ "Astrolabe, Inc. v. Olson et al". 2011-10-06. http://dockets.justia.com/docket/massachusetts/madce/1:2011cv11725/139342/. Retrieved 2011-10-06.
- ^ a b c "ASTROLABE, INC., Plaintiff, v. ARTHUR DAVID OLSON and PAUL EGGERT, Defendants.". 2011-09-30. http://web.mit.edu/jhawk/tmp/tz/gov.uscourts.mad.139342.1.0.pdf. Retrieved 2011-10-07.
- ^ a b Olson, Arthur David (2011-10-06). "Civil suit; ftp shutdown; mailing list shutdown". http://article.gmane.org/gmane.comp.time.tz/4133. Retrieved 2011-10-06.
- ^ "Time zone database shut down". The Daily Parker. 2011-10-06. http://www.thedailyparker.com/PermaLink,guid,c5f28bae-4b9c-41ea-b7b7-8891ad63c938.aspx. Retrieved 2011-10-06.
- ^ "Time-zone database - Astrolabe's opinion". Stephen Colebourne's blog. 13 October 2011. http://blog.joda.org/2011/10/time-zone-database-astrolabes-opinion.html. Retrieved 26 October 2011.
- ^ "Time-zone database down". Stephen Colebourne's blog. 6 October 2011. http://blog.joda.org/2011/10/today-time-zone-database-was-closed.html. Retrieved 26 October 2011.
- ^ "Civil Suit Filed, Involving the Time Zone Database". Slashdot. Geeknet, Inc.. 2011-10-06. http://slashdot.org/story/11/10/06/1743226/.
- ^ "Time Zone Database Has New Home After Lawsuit". 2011-10-17. http://abcnews.go.com/Technology/wireStory/time-zone-database-home-lawsuit-14753833. Retrieved 2011-10-30.
- ^ "IANA -- Time Zone Database". http://www.iana.org/time-zones.
- ^ Theory (text file), contained in the "tzcode" distribution. Version tzcode2007h.tar.gz 2007-10-01 referenced.
- ^ Olson, Arthur David (2010-05-01). "proposed time zone package changes (Bahia de Banderas; version naming)". gmane.comp.time.tz. http://article.gmane.org/gmane.comp.time.tz/3223.
- ^ Olson, Arthur David (2010-01-06). "RE: little nuance in brazil 1963". tz mailing list. http://article.gmane.org/gmane.comp.time.tz/3061.
- ^ "Time zone database has new home after lawsuit". 2011-10-17. http://mm.icann.org/pipermail/tz/2011-October/008109.html. Retrieved 2011-10-17.
- ^ "Unicode Locale Extension (‘u’) for BCP 47". CLDR - Unicode Common Locale Data Repository. http://cldr.unicode.org/index/bcp47-extension.
- ^ "Olson time zone support and setup". AIX 7.1 information. IBM. http://publib.boulder.ibm.com/infocenter/aix/v7r1/index.jsp?topic=/com.ibm.aix.baseadmn/doc/baseadmndita/olson_time_zone.htm. Retrieved 2011-03-12.
- ^ "Managing the Time Zone Variable". IBM. 2007-02-02. https://www-304.ibm.com/support/docview.wss?q1=tz&dc=DB520&rs=111&uid=isg3T1000252&context=SWG10. Retrieved 2011-03-11.
- ^ a b "AIX O/S updated to support 2007 Daylight Saving Time change". IBM. 2007-10-18. http://www14.software.ibm.com/webapp/set2/sas/f/enews/2006/10/DSTupdate.html. Retrieved 2011-03-12.
- ^ "2007 daylight savings time changes for Unix". Academic Computing and Communications Center, University of Illinois at Chicago. 2007-02-25. http://www.uic.edu/depts/accc/systems/daylight/unix.html. Retrieved 2008-03-18.)
- ^ Wickremasinghe, Christopher (2009-03-30). "Introduction of daylight saving time in Western Australia 2006". AIX Wiki. IBM. http://www.ibm.com/developerworks/wikis/display/usergroups/Introduction+of+Daylight+Saving+Time+in+Western+Australia++2006. Retrieved 2011-03-11.
- ^ Oracle Database Globalization Support Guide 10g Release 1 (10.1): Chapter 4, Section "Choosing a Time Zone File". Oracle Corporation. June 2004. pp. 4–14. Part No. B10749-02. http://download-uk.oracle.com/docs/cd/B14117_01/server.101/b10749/ch4datetime.htm#i1006667. Retrieved 2007-10-30.
- ^ "Windows → Tzid". Unicode Consortium. 2007-11-12. http://unicode.org/cldr/data/diff/supplemental/windows_tzid.html. Retrieved 2008-02-17.
External links
- ITU LEGAL TIME 2011
- The tz database home page (Down pending lawsuit, see IANA link below.)
- The tz mailing list archive
- "tz mailing list"; archives of these messages are available at ftp://elsie.nci.nih.gov/pub/tzarchive.gz.
- tz mailing list at ICANN
- “A literary appreciation of the Olson/Zoneinfo/tz database” by Jon Udell
- Official IANA sources
- Other sources for tzdata and tzcode
- Current tz source and data raw files - Shut down and inaccessible as of October 6, 2011 due to a civil lawsuit filed on September 30, 2011 in a federal court in Boston in which Arthur David Olson is a defendant, as per his message to the tz mailing list.
- Current tz data files compiled to multiple formats
- Lists of tz-zones by tz-group
- The tz database converted to CSV & SQL format
- Olson Database converted to Windows Standard Format
- tzcode at Apple
- tzcode at launchpad
- tzcode as part of libc in Debian
- tzdata in Debian
Coordinated Universal Time (UTC) UTC offset for standard time and
Daylight saving time (DST)
Italics: historical only−12:00 • −11:30 • −11:00 • −10:30 • −10:00 • −09:30 • −09:00 • −08:30 • −08:00 • −07:00±00:00 • +00:20 • +00:30 • +01:00 • +01:24 • +01:30 • +02:00 • +02:30 • +03:00 • +03:30 • +04:00 • +04:30 • +04:51 • +05:00 • +05:30 • +05:40 • +05:45+06:00 • +06:30 • +07:00 • +07:20 • +07:30 • +08:00 • +08:30 • +08:45 • +09:00 • +09:30 • +09:45 • +10:00 • +10:30 • +11:00 • +11:30+12:00 • +12:45 • +13:00 • +13:45 • +14:00Time zone data sources tz databaseLists of time zones time zones by country • time zones by UTC offset • tz database time zones • military time zones • time zone abbreviations • daylight saving time by countryCategories:- Time zones
- Tz database
Wikimedia Foundation. 2010.