|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Objectuk.ac.gla.dcs.renaissance.util.WarcHTMLResponseRecord
public class WarcHTMLResponseRecord
Constructor Summary | |
---|---|
WarcHTMLResponseRecord()
Default constructor |
|
WarcHTMLResponseRecord(WarcHTMLResponseRecord o)
Copy constructor |
|
WarcHTMLResponseRecord(WarcRecord o)
Constructor creation from a generic WARC record |
Method Summary | |
---|---|
String |
getHTMLContent()
|
WarcRecord |
getRawRecord()
|
long |
getStartMarker()
Gets the start marker, the position in the stream where the WARC record starts. |
long |
getStopMarker()
Gets the stop marker, the position in the stream where the WARC record end. |
int |
getStopMarkerDiff()
Gets the difference between the start and stop marker |
String |
getTargetTrecID()
|
String |
getTargetURI()
|
Vector<String> |
getURLOutlinks()
Gets a vector of normalized URLs (normalized to this target URI) of the outlinks of the page |
boolean |
isHTMLResponse()
Test if the underlying record is really a HTML response. |
void |
setRecord(WarcRecord o)
|
Methods inherited from class java.lang.Object |
---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Constructor Detail |
---|
public WarcHTMLResponseRecord()
public WarcHTMLResponseRecord(WarcHTMLResponseRecord o)
o
- public WarcHTMLResponseRecord(WarcRecord o)
o
- Method Detail |
---|
public void setRecord(WarcRecord o)
public boolean isHTMLResponse()
true
if record is an HYTML response,
false
otherwisepublic WarcRecord getRawRecord()
public String getTargetURI()
public String getTargetTrecID()
public Vector<String> getURLOutlinks()
public int getStopMarkerDiff()
public long getStopMarker()
public long getStartMarker()
public String getHTMLContent()
|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |