B ¨#aöã@s0ddlZddlZddlmZGdd„deƒZdS)éNé)Ú ProbingStatec@sneZdZdZddd„Zdd„Zedd„ƒZd d „Zed d „ƒZ d d„Z e dd„ƒZ e dd„ƒZ e dd„ƒZdS)Ú CharSetProbergffffffî?NcCsd|_||_t t¡|_dS)N)Ú_stateÚ lang_filterÚloggingÚ getLoggerÚ__name__Úlogger)Úselfr©r úd}n |dkrJd}|dkr| ¡s||kr‚|s‚| |||…¡| d¡|d}qW|s¤| ||d …¡|S) aÈ Returns a copy of ``buf`` that retains only the sequences of English alphabet and high byte characters that are not between <> characters. Also retains English alphabet and high byte characters immediately before occurrences of >. This filter can be applied to all scripts which contain both English characters and extended ASCII characters, but is currently only used by ``Latin1Prober``. Frró>ós