view results/tld_nonlatin_2019-35.tsv @ 246:666069efb0c6

output bytes, pickle and save dict if -p, trim lm value to int
author Henry S. Thompson <ht@inf.ed.ac.uk>
date Thu, 02 Jan 2025 14:51:00 +0000
parents c15cef19b584
children
line wrap: on
line source

3	ru	ru	Cyrl	Cyrillic
7	jp	ja	Jpan	Japanese (alias for Han + Hiragana + Katakana)
8	cn	zh	Hant	Han (Traditional variant)
20	ua	uk	Cyrl	Cyrillic
23	tw	zh	Hant	Han (Traditional variant)
30	gr	el	Grek	Greek
35	kr	ko	Kore	Korean (alias for Hangul + Han)
39	ir	fa	Arab	Arabic
42	il	he	Hebr	Hebrew
53	by	be	Cyrl	Cyrillic
59	bg	bg	Cyrl	Cyrillic
63	cc	ms	Arab	Arabic
67	kz	kk	Arab	Arabic
72	my	ms	Arab	Arabic
75	hk	zh	Hant	Han (Traditional variant)
76	th	th	Thai	Thai
83	ge	ka	Geor	Georgian (Mkhedruli and Mtavruli)
88	pk	ur	Arab	Arabic
92	am	hy	Armn	Armenian
100	ae	ar	Arab	Arabic
113	ma	ar	Arab	Arabic
114	mk	mk	Cyrl	Cyrillic
118	lk	si	Sinh	Sinhala
123	mn	mn	Cyrl	Cyrillic
126	kg	ky	Cyrl	Cyrillic
127	ly	ar	Arab	Arabic
128	sa	ar	Arab	Arabic
130	bd	bn	Beng	Bengali (Bangla)
133	la	lo	Laoo	Lao
138	cy	el	Grek	Greek
144	eg	ar	Arab	Arabic
148	tn	ar	Arab	Arabic
153	np	ne	Deva	Devanagari (Nagari)
161	ps	ar	Arab	Arabic
162	lb	ar	Arab	Arabic
184	dz	ar	Arab	Arabic
187	tj	tg	Cyrl	Cyrillic
194	qa	ar	Arab	Arabic
199	kh	km	Khmr	Khmer
207	mo	zh	Hant	Han (Traditional variant)