a _bL&@sdZddlmZmZddlmZGdddeZedddgd d d gd ed ddgdddgd edddgdddgd edddddgddgd edddgd d!d"gd ed#d$dd%d&gd'd(gd ed)d*dd+d,gd-d.gd ed/d0dd%d&gd1gd2ed3d4dd5gd6d7gd ed8d9dgd d:d;gd edd?d@gd edAdBdgd dCdDgd edEdFdgd dGdHdIgd edJdKddLdMgdNdOgd edPdQdddgdRdSgd edTdUdddgdVdWgd edXdYdgd dZd[gd ed\d]dgd^d_d`gd edadbdgd^dcddgd ededfdgdgdhdigd edjdkdd%d&gdlgd2edmdndddgdodpgd edqdrdgd dsdtgd edudvdddgdwdxgd edydzdgd{d|d}gd ed~ddddgddgd edddddgddgd edddgdgdgdedddgdddgd edddgdddgd eddddgddgd dZdS)z Metadata about languages used by our model training code for our SingleByteCharSetProbers. Could be used for other things in the future. This code is based on the language metadata from the uchardet project. )absolute_importprint_function) ascii_letterscs*eZdZdZdfdd ZddZZS) LanguageaMetadata about a language useful for training models :ivar name: The human name for the language, in English. :type name: str :ivar iso_code: 2-letter ISO 639-1 if possible, 3-letter ISO code otherwise, or use another catalog as a last resort. :type iso_code: str :ivar use_ascii: Whether or not ASCII letters should be included in trained models. :type use_ascii: bool :ivar charsets: The charsets we want to support and create data for. :type charsets: list of str :ivar alphabet: The characters in the language's alphabet. If `use_ascii` is `True`, you only need to add those not in the ASCII set. :type alphabet: str :ivar wiki_start_pages: The Wikipedia pages to start from if we're crawling Wikipedia for training data. :type wiki_start_pages: list of str NTcsrtt|||_||_||_||_|jr@|r:|t7}qLt}n |sLtd|rbd t t |nd|_ ||_ dS)Nz*Must supply alphabet if use_ascii is False)superr__init__nameiso_code use_asciicharsetsr ValueErrorjoinsortedsetalphabetwiki_start_pages)selfr r r r rr __class__A/tmp/pip-target-4jja1joz/lib/python/chardet/metadata/languages.pyr$s zLanguage.__init__cCs&d|jjddd|jDS)Nz{}({})z, css(|] \}}|dsd||VqdS)_z{}={!r}N) startswithformat).0kvrrr 7s z$Language.__repr__..)rr__name__r__dict__items)rrrr__repr__5s   zLanguage.__repr__)NNTNNN)r __module__ __qualname____doc__rr" __classcell__rrrrrs rArabicarF)z ISO-8859-6z WINDOWS-1256ZCP720ZCP864ubءآأؤإئابةتثجحخدذرزسشصضطظعغػؼؽؾؿـفقكلمنهوىيًٌٍَُِّuالصفحة_الرئيسية)r r r r rr Belarusianbe) ISO-8859-5 WINDOWS-1251IBM866 MacCyrillicuАБВГДЕЁЖЗІЙКЛМНОПРСТУЎФХЦЧШЫЬЭЮЯабвгдеёжзійклмнопрстуўфхцчшыьэюяʼu!Галоўная_старонка Bulgarianbg)r+r,IBM855uxАБВГДЕЖЗИЙКЛМНОПРСТУФХЦЧШЩЪЬЮЯабвгдежзийклмнопрстуфхцчшщъьюяuНачална_страницаCzechczTz ISO-8859-2z WINDOWS-1250u<áčďéěíňóřšťúůýžÁČĎÉĚÍŇÓŘŠŤÚŮÝŽuHlavní_stranaDanishda) ISO-8859-1z ISO-8859-15 WINDOWS-1252u æøåÆØÅZForsideGermander6r7uäöüßÄÖÜzWikipedia:HauptseiteGreekelz ISO-8859-7z WINDOWS-1253uαβγδεζηθικλμνξοπρσςτυφχψωάέήίόύώΑΒΓΔΕΖΗΘΙΚΛΜΝΞΟΠΡΣΣΤΥΦΧΨΩΆΈΉΊΌΎΏuΠύλη:ΚύριαEnglishenZ Main_Page)r r r r r Esperantoeo ISO-8859-3uDabcĉdefgĝhĥijĵklmnoprsŝtuŭvzABCĈDEFGĜHĤIJĴKLMNOPRSŜTUŬVZuVikipedio:ĈefpaĝoSpanishesuñáéíóúüÑÁÉÍÓÚÜzWikipedia:PortadaEstonianet) ISO-8859-4 ISO-8859-13 WINDOWS-1257u6ABDEGHIJKLMNOPRSTUVÕÄÖÜabdeghijklmnoprstuvõäöüZEsilehtFinnishfiuÅÄÖŠŽåäöšžzWikipedia:EtusivuFrenchfru,œàâçèéîïùûêŒÀÂÇÈÉÎÏÙÛÊuWikipédia:Accueil_principaluBœuf (animal)Hebrewhez ISO-8859-8z WINDOWS-1255u<אבגדהוזחטיךכלםמןנסעףפץצקרשתװױײuעמוד_ראשיCroatianhru@abcčćdđefghijklmnoprsštuvzžABCČĆDĐEFGHIJKLMNOPRSŠTUVZŽZGlavna_stranica HungarianhuuPabcdefghijklmnoprstuvzáéíóöőúüűABCDEFGHIJKLMNOPRSTUVZÁÉÍÓÖŐÚÜŰu KezdőlapItalianituÀÈÉÌÒÓÙàèéìòóùZPagina_principale Lithuanianlt)rFrGrEuRAĄBCČDEĘĖFGHIĮYJKLMNOPRSŠTUŲŪVZŽaąbcčdeęėfghiįyjklmnoprsštuųūvzžZPagrindinis_puslapisLatvianlvuXAĀBCČDEĒFGĢHIĪJKĶLĻMNŅOPRSŠTUŪVZŽaābcčdeēfgģhiījkķlļmnņoprsštuūvzžu Sākumlapa Macedonianmk)r+r,r.r1u|АБВГДЃЕЖЗЅИЈКЛЉМНЊОПРСТЌУФХЦЧЏШабвгдѓежзѕијклљмнњопрстќуфхцчџшuГлавна_страницаDutchnlZ HoofdpaginaPolishpluRAĄBCĆDEĘFGHIJKLŁMNŃOÓPRSŚTUWYZŹŻaąbcćdeęfghijklłmnńoóprsśtuwyzźżuWikipedia:Strona_główna Portugueseptu0ÁÂÃÀÇÉÊÍÓÔÕÚáâãàçéêíóôõúuWikipédia:Página_principalRomanianrouăâîșțĂÂÎȘȚuPagina_principalăRussianru)r+r,zKOI8-Rr.r-r1uабвгдеёжзийклмнопрстуфхцчшщъыьэюяАБВГДЕЁЖЗИЙКЛМНОПРСТУФХЦЧШЩЪЫЬЭЮЯu#Заглавная_страницаSlovakskuDáäčďéíĺľňóôŕšťúýžÁÄČĎÉÍĹĽŇÓÔŔŠŤÚÝŽuHlavná_stránkaSloveneslu8abcčdefghijklmnoprsštuvzžABCČDEFGHIJKLMNOPRSŠTUVZŽZ Glavna_stranSerbiansruxАБВГДЂЕЖЗИЈКЛЉМНЊОПРСТЋУФХЦЧЏШабвгдђежзијклљмнњопрстћуфхцчџшuГлавна_страна)r r rr rThaith)z ISO-8859-11zTIS-620ZCP874uกขฃคฅฆงจฉชซฌญฎฏฐฑฒณดตถทธนบปผฝพฟภมยรฤลฦวศษสหฬอฮฯะัาำิีึืฺุู฿เแโใไๅๆ็่้๊๋์ํ๎๏๐๑๒๓๔๕๖๗๘๙๚๛uหน้าหลักTurkishtr)r@z ISO-8859-9z WINDOWS-1254uRabcçdefgğhıijklmnoöprsştuüvyzâîûABCÇDEFGĞHIİJKLMNOÖPRSŞTUÜVYZÂÎÛZ Ana_Sayfa Vietnameseviz WINDOWS-1258uHaăâbcdđeêghiklmnoôơpqrstuưvxyAĂÂBCDĐEÊGHIKLMNOÔƠPQRSTUƯVXYuChữ_Quốc_ngữ)r'r)r/r2r4r8r:r<r>rArCrHrJrLrNrPrRrTrVrXrZr\r^r`rbrdrfrhrjrlrnN) r% __future__rrstringrobjectrZ LANGUAGESrrrrs ,