org.knowceans.corpus.parsers.dpa
Class DpaDocument

java.lang.Object
  extended by org.knowceans.corpus.ADocument
      extended by org.knowceans.corpus.parsers.dpa.DpaDocument

public class DpaDocument
extends ADocument

XptDocument

Author:
heinrich

Field Summary
 java.lang.String abz
          <abz>?
 java.lang.String dzg
          <dzg>time date </dzg>
 java.lang.String mid
          <lfd_nr>message no.
 java.lang.String mko
          <mko>?
 java.util.Vector<java.lang.Integer> paragraphIndex
          Indices into txt where paragraphs start.
 java.util.Vector<java.lang.Integer> sco
          <sco>dpa 40-topic categories (multiple; 4-letter German ids) </sco>
 java.util.Vector<java.lang.String> sub
          <sub>subtitle </sub>
 java.lang.String swz
          <swz>keywords </swz>
 java.lang.String vkn
          <vkn>?
 
Fields inherited from class org.knowceans.corpus.ADocument
key, sentenceIndex, txt, ueb
 
Constructor Summary
DpaDocument()
           
 
Method Summary
 boolean isValid()
          if the document has enough information to be valid, i.e., have at least a text body and key.
 
Methods inherited from class org.knowceans.corpus.ADocument
toString
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, wait, wait, wait
 

Field Detail

mid

public java.lang.String mid
<lfd_nr>message no. </lfd_nr>


vkn

public java.lang.String vkn
<vkn>? </vkn>


mko

public java.lang.String mko
<mko>? </mko>


sco

public java.util.Vector<java.lang.Integer> sco
<sco>dpa 40-topic categories (multiple; 4-letter German ids) </sco>


swz

public java.lang.String swz
<swz>keywords </swz>


sub

public java.util.Vector<java.lang.String> sub
<sub>subtitle </sub>


paragraphIndex

public java.util.Vector<java.lang.Integer> paragraphIndex
Indices into txt where paragraphs start.


abz

public java.lang.String abz
<abz>? </abz>


dzg

public java.lang.String dzg
<dzg>time date </dzg>

Constructor Detail

DpaDocument

public DpaDocument()
Method Detail

isValid

public boolean isValid()
if the document has enough information to be valid, i.e., have at least a text body and key.

Returns: