uk.ac.gla.dcs.renaissance.iqir.in
Class PureINExtractor

java.lang.Object
  extended by uk.ac.gla.dcs.renaissance.iqir.in.PureINExtractor
Direct Known Subclasses:
BoundaryINExtractor, SlidingWindowINExtractor

public abstract class PureINExtractor
extends Object

Given a compact representation of a document, get its pure IN fragments

Author:
B. Piwowarski

Constructor Summary
PureINExtractor()
           
 
Method Summary
abstract  Iterator<bpiwowar.maths.matrix.DoubleMatrix1D> getIterator(CompactDocument document)
          Get an iterator for the document pure information need aspects
abstract  Iterator<bpiwowar.maths.matrix.DoubleMatrix1D> getIterator(CompactDocument document, Iterator<it.unimi.dsi.util.Interval> intervalIterator)
          Extract pure INs using an interval iterator
 void init(IndexConfiguration index, CompactDocumentStorage storage)
          Initialise the pure IN extractor
 Iterable<bpiwowar.maths.matrix.DoubleMatrix1D> vectors(CompactDocument document)
          Get the vectors for one document given its id
 Iterable<bpiwowar.maths.matrix.DoubleMatrix1D> vectors(CompactDocument document, it.unimi.dsi.util.Interval... interval)
           
 Iterable<bpiwowar.maths.matrix.DoubleMatrix1D> vectors(CompactDocument document, Iterator<it.unimi.dsi.util.Interval> intervalIterator)
          Get the vectors for one document and a term given its id
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

PureINExtractor

public PureINExtractor()
Method Detail

init

public void init(IndexConfiguration index,
                 CompactDocumentStorage storage)
Initialise the pure IN extractor


vectors

public final Iterable<bpiwowar.maths.matrix.DoubleMatrix1D> vectors(CompactDocument document)
Get the vectors for one document given its id

Parameters:
document - The document
Returns:
An iterable on the list of (non normalised) term vectors
Throws:
Exception - When an exception occurs

vectors

public final Iterable<bpiwowar.maths.matrix.DoubleMatrix1D> vectors(CompactDocument document,
                                                                    Iterator<it.unimi.dsi.util.Interval> intervalIterator)
Get the vectors for one document and a term given its id

Parameters:
document - The document
Returns:
An iterable on the list of (non normalised) term vectors
Throws:
Exception - When an exception occurs

getIterator

public abstract Iterator<bpiwowar.maths.matrix.DoubleMatrix1D> getIterator(CompactDocument document)
Get an iterator for the document pure information need aspects

Parameters:
document - The document from which we want the aspects
Returns:
an iterator over vectors in the index term space

getIterator

public abstract Iterator<bpiwowar.maths.matrix.DoubleMatrix1D> getIterator(CompactDocument document,
                                                                           Iterator<it.unimi.dsi.util.Interval> intervalIterator)
Extract pure INs using an interval iterator


vectors

public Iterable<bpiwowar.maths.matrix.DoubleMatrix1D> vectors(CompactDocument document,
                                                              it.unimi.dsi.util.Interval... interval)


Copyright © 2011. All Rights Reserved.