I am not someone who have worked much on Java and so when the openNLP tools(www.opennlp.com) were in java i knew i was going to have some difficult time. On top of this the documentation for this is almost non-existent. After spending some time on the net I came across this nice article from Daniel which had code snippets to get started on NLP. He has very well explained the basic starting steps and there is no point in me repeating the steps. This is the link to Daniel's blog.
After some firefighting I have finally been able to write a sample code which can complement steps provided by Daniel.I have uploaded the source code of the same.It contains all required external libraries including maxent, opennlp-tools, opennlp and trove. Also I have included required models to parse english sentences. You can download the code from here
Wednesday, February 18, 2009
Subscribe to:
Post Comments (Atom)
8 comments:
i hope this article would inspire me in making my thesis softwarwe...which is one week from now...
Hey thx for ur effort, and for referring to the other blog....
I tried going through Daniels blog as well as the source code u have been nice enough to put..... but i still couldnt get anything to run properly.....
in the stuff u have uploaded we just have to execute that run file is that correct ???????
I had some problem with your code. I have the memory heap problem that you posted on Daniels blog. I would greatly appreciate it if you could send me an e-mail at gio[at]ct.is with your solution to that problem
Many thanks.
I also got the same problem, however there is a way to fix this problem of memory by dedicating more memory through the JVM, have a look at this page, http://hausheer.osola.com/docs/5. Hope this will solve your problem.
Sorry guys I got a little busy and could not follow up with your comments. You just need to run the run.sh and it should work. It works for me and only extra stuff I had done is setting up of jdk/bin path in my bash profile. If you open the run.sh file you can notice the parameter -Xmx350m I have used to increase the heap size to do away with the memory heap problem.
Yeah, I figured it out at last. I am using eclipse and I finally found where to put in the environment variables to increase heap size memory, 256Mb was enough for me. :) But thanks. :)
Hi vikas, thnks for the code dump.
I was wondering if you hav tried using the PosTagger as well.Acc to Daniel's blog, the PosTagger cud also be used as a light weight soln. I cudnt find much info abt its usage.
It wud be helpful if you cud post some sample code in case you have it.
Thnks
Hi Gunni,
Even I had a problem with the code i.e the OOM exception. However, when i tried to run it via a cmd prompt, it worked fine.
But in eclipse it failed. Can you tell me what exactly u did to get it working?
Post a Comment