close

[李東昇的意見]全世界的 6912 種 語言 的 統一編碼

(ISO 639-3 不只包含 6912種 現存的語言

還包含了 已滅絕的語言(例如 古埃及文)

還包含了 人造的語言(例如 世界語 或 魔戒作者發明的 精靈語 等)

共計 7590 個 代碼!

資料太多,請自行下載

請參看這個網站:

http://www.ethnologue.com/ethno_docs/distribution.asp?by=area

會讓你對全世界語言的現況,有一個簡單的概念

如果你對 英文排斥,就請耐心看我的部落格,我會做一個完整介紹

===================

資料來源/下載網頁:

http://www.sil.org/iso639-3/download.asp

  


ISO 639-3 Downloads


This page offers links to downloadable versions of the complete ISO 639-3 code set, a language names index, the mapping of macrolanguages to individual languages, and the mapping of retired code elements to current code elements. The Code Set table and the Language Names Index table are formatted as tab-delimited, UTF-8 text files. These tables are also offered as ISO/IEC 8859-1 encoded text files, though these simplified encodings should not be considered normative. The remaining two files map between identifiers; they are presently encoded in ISO/IEC 8859-1, as currently all data in these tables can be correctly encoded in the more limited character set. The first line of each file contains the column names rather than the first row of data.

=========================================

ISO 639-3 Code Set 
代碼序


NOTE: The "Ref_Name" column in this table contains a reference name by which this language is identified in the standard. The Reference Name is offered for ease of use of the code set, and does not imply it is to be preferred in any application to any other name that may be associated with the particular language as given in the Language Names Index. However, we do wish to avoid use of any pejorative name as the Reference Name. If you are aware of any instance in which a code element uses a pejorative or derogatory name as the Reference Name, please bring this to our attention at the contact email address below.

The complete code table of active code elements may be downloaded by clicking the following link.

Download ISO 639-3 code set UTF-8
(點選左邊下載)


For users with systems unable to utilize Unicode character encoding, the code set table is also offered in a simplified version in Latin-1 (ISO/IEC 8859-1).

The following declaration is a sample formal definition for a SQL database table into which the tab-delimited file can be loaded:

CREATE TABLE ISO_639-3 (
Id char(3) NOT NULL, -- The three-letter 639-3 identifier
Part2B char(3) NULL, -- Equivalent 639-2 identifier of the bibliographic applications code set, if there is one
Part2T char(3) NULL, -- Equivalent 639-2 identifier of the terminology applications code set, if there is one
Part1 char(2) NULL, -- Equivalent 639-1 identifier, if there is one
Scope char(1) NOT NULL, -- I(ndividual), M(acrolanguage), S(pecial)
Type char(1) NOT NULL, -- A(ncient), C(onstructed),
-- E(xtinct), H(istorical), L(iving), S(pecial)
Ref_Name varchar(150) NOT NULL) -- Reference language name

============================================

Language Names Index 語言名稱序


In ISO 639-2, there are multiple name forms for some identified languages. The ISO 639-3 code tables now include a language name index with multiple names associated many code elements (primarily in English forms or variant anglicized spellings of indigenous names). The reference name from the Ref_Name field of the main table is included as an entry in this table, thus every code element has at least one row occurrence in the Language Names Index table. The name appears in two forms, a "print" form used in most contexts, and an inverted form which fronts a language name root, e.g., "Isthmus Zapotec" and "Zapotec, Isthmus". Where there is no root part to the name, the Print_Name and the Inverted_Name contain identical strings. The Language Names Index may be downloaded by clicking the following link

Download ISO 639-3 Language Names Index UTF-8  (點選左邊下載)


For users with systems unable to utilize Unicode character encoding, the Language Names Index table is also offered in a simplified version in Latin-1 (ISO/IEC 8859-1).

The following declaration is a sample formal definition for a SQL database table into which the tab-delimited file can be loaded:

CREATE TABLE ISO_639-3_Names (
Id char(3) NOT NULL, -- The three-letter 639-3 identifier
Print_Name varchar(75) NOT NULL, -- One of the names associated with this identifier
Inverted_Name varchar(75) NOT NULL) -- The inverted form of this Print_Name form

==========================================

Macrolanguage Mappings 語支


The complete set of mappings from macrolanguages to the individual languages that comprise them may be downloaded by clicking the following link.

Download ISO 639-3 macrolanguage mappings (點選左邊下載)


The table has only two columns; the first identifies a macrolanguage and the second identifies one of its members. Thus a given macrolanguage has as many rows as it has individual languages that are its members. The following declaration is a sample formal definition for a SQL database table into which the tab-delimited file can be loaded:

CREATE TABLE ISO_639-3_Macrolanguages (
M_Id char(3) NOT NULL, -- The identifier for a macrolanguage
I_Id char(3) NOT NULL) -- The identifier for an individual language
-- that is a member of the macrolanguage

=========================================

Retired Code Element Mappings 過期代碼


The annual update to the 639-3 code set will include a complete listing of the code elements that have been retired with instructions on how to update existing data. Since the initial release of ISO/FDIS 639-3 and prior to the release of ISO 639-3, there was one retirement, a correction to the alignment between ISO 639-3 and ISO 639-2. It is included in the Retired Code Element Mappings because it has been a source of confusion for users. The Retired Code Element Mappings table may be downloaded by clicking the following link.

Download ISO 639-3 code retirement mappings (點選左邊下載)


The table has five columns; the first has the affected identifier, the second has a coded value for the reason the retirement was necessary, the third contains a single identifier if the retired identifier maps unambiguously to another identifier, the fourth contains a prose statement about what should be done to update a code element split, and the fifth gives the date the change was made effective. The following declaration is a sample formal definition for a SQL database table into which the tab-delimited file can be loaded:

CREATE TABLE ISO_639-3_Retirements (
Id char(3) NOT NULL, -- The three-letter 639-3 identifier
Ret_Reason char(1) NOT NULL, -- code for retirement: C (change), D (duplicate),
-- N (non-existent), S (split), M (merge)
Change_To char(3) NULL, -- in the cases of C, D, and M, the identifier
-- to which all instances of this Id should be changed
Ret_Remedy varchar(200) NOT NULL, -- The instructions for updating an instance
-- of the retired (split) identifier
Effective date NOT NULL) -- The date the retirement became effective

arrow
arrow
    全站熱搜

    alexandroslee 發表在 痞客邦 留言(0) 人氣()