How can I represent category of sentence predicted from Naive Bayes as a feature in CRF++ or Wapiti?
For instance, if the sentence, Tumblr merges with Yahoo., is classified as Business, then while composing the training file for crf, where can I indicate the label Business as a feature? And how should then the template be modeled?
Should the train file be like this
Tumblr business ORG
merges business O
with business O
Yahoo business ORG
Or only include the category with the ORG label? How so? And the template file?
Method 1: You can add
businessas a feature in the same way you have shown or you can simply write1instead ofbusiness. Similarly, for categorysportsyou can add another column and the value in this column shall be1for words belonging to sports sentence. You'll have to add each column in the template file too, respectively.Method 2: Including category with ORG might not be a good idea because the same ORG can appear in different categories.