ó <ŋCVc@sądZddlmZddlZddlmZddlTidd6dd 6d d 6d d 6dd6dd6ZeeƒZej de fd„ƒYƒZ de fd„ƒYZ dS(u$ Corpus reader for the Information Extraction and Entity Recognition Corpus. NIST 1999 Information Extraction: Entity Recognition Evaluation http://www.itl.nist.gov/iad/894.01/tests/ie-er/er_99/er_99.htm This corpus contains the NEWSWIRE development test data for the NIST 1999 IE-ER Evaluation. The files were taken from the subdirectory: /ie_er_99/english/devtest/newswire/*.ref.nwt and filenames were shortened. The corpus contains the following files: APW_19980314, APW_19980424, APW_19980429, NYT_19980315, NYT_19980403, and NYT_19980407. iĸĸĸĸ(tunicode_literalsN(tcompat(t*u&Associated Press Weekly, 14 March 1998u APW_19980314u&Associated Press Weekly, 24 April 1998u APW_19980424u&Associated Press Weekly, 29 April 1998u APW_19980429uNew York Times, 15 March 1998u NYT_19980315uNew York Times, 3 April 1998u NYT_19980403uNew York Times, 7 April 1998u NYT_19980407t IEERDocumentcBs&eZddddd„Zd„ZRS(ucCs1||_||_||_||_||_dS(N(ttexttdocnotdoctypet date_timetheadline(tselfRRRRR((si/private/var/folders/cc/xm4nqn811x9b50x1q_zpkmvdjlphkp/T/pip-build-FUwmDn/nltk/nltk/corpus/reader/ieer.pyt__init__-s     cCs“|jr$dj|jjƒƒ}nCdjg|jjƒD]}|d dkr:|^q:d ƒd}|jdk r‡d|j|fSd|SdS(Nu iuu(RtjointleavesRRtNone(R Rtw((si/private/var/folders/cc/xm4nqn811x9b50x1q_zpkmvdjlphkp/T/pip-build-FUwmDn/nltk/nltk/corpus/reader/ieer.pyt__repr__5s 'N(t__name__t __module__R R R(((si/private/var/folders/cc/xm4nqn811x9b50x1q_zpkmvdjlphkp/T/pip-build-FUwmDn/nltk/nltk/corpus/reader/ieer.pyR+stIEERCorpusReadercBsMeZdZdd„Zdd„Zdd„Zd„Zd„Zd„Z RS(u cCsb|dkr|j}nt|tjƒr6|g}ntg|D]}|j|ƒjƒ^q@ƒS(N(R t_fileidst isinstanceRt string_typestconcattopentread(R tfileidstf((si/private/var/folders/cc/xm4nqn811x9b50x1q_zpkmvdjlphkp/T/pip-build-FUwmDn/nltk/nltk/corpus/reader/ieer.pytrawCs   cCsAtg|j|tƒD]$\}}t||jd|ƒ^qƒS(Ntencoding(RtabspathstTruetStreamBackedCorpusViewt _read_block(R Rtfileidtenc((si/private/var/folders/cc/xm4nqn811x9b50x1q_zpkmvdjlphkp/T/pip-build-FUwmDn/nltk/nltk/corpus/reader/ieer.pytdocsHscCsAtg|j|tƒD]$\}}t||jd|ƒ^qƒS(NR(RRRRt_read_parsed_block(R RR!R"((si/private/var/folders/cc/xm4nqn811x9b50x1q_zpkmvdjlphkp/T/pip-build-FUwmDn/nltk/nltk/corpus/reader/ieer.pyt parsed_docsMscCsAg|j|ƒD]-}|j|ƒjdk r|j|ƒ^qS(N(R t_parseRR (R tstreamtdoc((si/private/var/folders/cc/xm4nqn811x9b50x1q_zpkmvdjlphkp/T/pip-build-FUwmDn/nltk/nltk/corpus/reader/ieer.pyR$SscCs?tjj|ddƒ}t|tƒr1t|St|ƒSdS(Nt root_labeluDOCUMENT(tnltktchunkt ieerstr2treeRtdictR(R R(tval((si/private/var/folders/cc/xm4nqn811x9b50x1q_zpkmvdjlphkp/T/pip-build-FUwmDn/nltk/nltk/corpus/reader/ieer.pyR&Xs cCsĒg}x6tr>|jƒ}|s%Pn|jƒdkr Pq q W|j|ƒxCtr‘|jƒ}|skPn|j|ƒ|jƒdkrOPqOqOWdj|ƒgS(Nuuu (RtreadlinetstriptappendR (R R'touttline((si/private/var/folders/cc/xm4nqn811x9b50x1q_zpkmvdjlphkp/T/pip-build-FUwmDn/nltk/nltk/corpus/reader/ieer.pyR _s       N( RRt__doc__R RR#R%R$R&R (((si/private/var/folders/cc/xm4nqn811x9b50x1q_zpkmvdjlphkp/T/pip-build-FUwmDn/nltk/nltk/corpus/reader/ieer.pyR@s     (R4t __future__RR*Rtnltk.corpus.reader.apittitlestsortedt documentstpython_2_unicode_compatibletobjectRt CorpusReaderR(((si/private/var/folders/cc/xm4nqn811x9b50x1q_zpkmvdjlphkp/T/pip-build-FUwmDn/nltk/nltk/corpus/reader/ieer.pyts