Package uk.ac.gla.dcs.renaissance.mg4j.trec

Class Summary
TRECBuildIndex  
TRECDocumentCollection A collection for the TREC data set.
TRECDocumentCollection.Match Useful to match a series of bytes
TRECDocumentCollection.Options  
TRECDocumentCollection.TRECDocumentDescriptor A compact description of the location and of the internal segmentation of a TREC document inside a file.
TRECDocumentFactory A factory that provides fields for body and title of HTML documents.
TRECSegmentedTextExtractor A callback extracting text and titles for TREC documents.
WARCDocumentCollection Managing TREC collections provided in a WARC format, as used for instance by the TREC session track.
 

Enum Summary
TRECDocumentCollection.Compression  
TRECDocumentFactory.CollectionType  
TRECDocumentFactory.Fields  
 



Copyright © 2011. All Rights Reserved.