Your email was sent successfully. Check your inbox.

An error occurred while sending the email. Please try again.

Proceed reservation?

Export
  • 1
    E-Resource
    E-Resource
    Philadelphia, Pa. : Linguistic Data Consortium
    UID:
    (DE-627)1600977596
    Format: 1 DVD-ROM , 4 3/4 in
    ISBN: 9781585634866 , 1585634867
    Uniform Title: New York times
    Content: "The New York Times Corpus contains over 1.8 million articles written and published by the New York Times between January 1, 1987 and June 19, 2007 with article metadata provided by the New York Times Newsroom, the New York Times Indexing Service and the online production staff at nytimes.com. The corpus includes: over 1.8 million articles (excluding wire services articles that appeared during the covered period); over 650,000 article summaries written; over 1,500,000 articles manually tagged by library scientists with tags drawn from a normalized indexing vocabulary of people, organizations, locations and topic descriptors; over 275,000 algorithmically-tagged articles that have been hand verified by the online production staff at nytimes.com; Java tools for parsing corpus documents from .xml into a memory resident object. As part of the New York Times' indexing procedures, most articles are manually summarized and tagged by a staff of library scientists. This collection contains over 650,000 article-summary pairs which may prove to be useful in the development and evaluation of algorithms for automated document summarization. Also, over 1.5 million documents have at least one tag. Articles are tagged for persons, places, organizations, titles and topics using a controlled vocabulary that is applied consistently across articles."--Index HTML document
    Note: Title from disc label. - LDC2008T19
    Language: English
    Keywords: Hochschulschrift ; DVD-ROM
    Library Location Call Number Volume/Issue/Year Availability
    BibTip Others were also interested in ...
Close ⊗
This website uses cookies and the analysis tool Matomo. Further information can be found on the KOBV privacy pages