You're discussing part-of-speech tagging of natural language here, which is considered "solved" (almost solved) and for which there exist very robust programs already. I linked to some part of speech...
Type: Posts; User: KONI
You're discussing part-of-speech tagging of natural language here, which is considered "solved" (almost solved) and for which there exist very robust programs already. I linked to some part of speech...
You're completely forgetting the fact that natural language is by definition ambiguous.
Sometimes writers don't want you to know if something is a verb or a noun and they make full usage of the two...
Let's throw a bit of theory into this, shall we:
Computational processing of textual data
Interesting parts:
Part-of-Speech tagging (Brill, HMM)
Part-of-Speech tagging (HMM ctnd.)