![]() the KANJIDIC2 file is in XML and is structured according to its DTD (Document Type Definition).where the kanji is a radical, and the radical name is not already a reading, the radical name is preceded the marker "T2".name) readings, these are preceded the marker "T1" where the kanji has special nanori (i.e.The current other classes, and their tagging, are: There may be several classes of reading fields, with ordinary readings first, followed by members of the other classes, if any. Hyphens are used to indicate prefixes/suffixes, and '.' indicates the portion of the reading that is okurigana. An exception is the set of kokuji for measurements such as centimetres, where the reading is in katakana. ON readings (音読み) are generally in katakana and KUN readings (訓読み) in hiragana. For example "S10" indicates a stroke count of 10 information fields beginning with one or two-letter codes as per the table below.the kanji itself followed by the hexadecimal form of the JIS ku-ten coding, e.g.the KANJIDIC and KANJD212 files are text files with one line per kanji and the information fields separated by spaces.The format of the distributed files as as follows: The data is described in the following table. The database and distributed data files contain an entry for each of the kanji, with each entry containing a number of fields of data about the kanji. the KANJD212 file, which also is in EUC-JP coding and covers the 5,801 kanji in JIS X 0212.the KANJIDIC file, which in in EUC-JP coding and covers the 6,355 kanji in JIS X 0208.the KANJIDIC2 file, which is in XML format and Unicode/UTF-8 coding, and contains information about all 13,108 kanji.Three data files are distributed by this project: JIS X 0213-2012, which extends JIS X 0208, overlaps with some of JIS X 0212, and adds 952 additional kanji.JIS X 0212-1990, which includes extra 5,801 kanji. ![]() It covers the 13,108 kanji in three main Japanese standards: ![]() The KANJIDIC project, which began in 1991, has the goal of compiling and distributing comprehensive information on the kanji used in Japanese text processing. ![]() (Note that this page in the process of being rewritten, so be patient with any aspects that seems incomplete.) Introduction ![]()
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |