Computational Linguistics

Duplicating -tagSeparator effect when using Stanford Parser programmatically

I have been using Stanford NLP Parser from command line with -tagSeparator flag to supply it with partially tagged input. As the parser seems to be really bad with date expressions and complex name entities, I need this functionality. Now, I need to wrap-up the parser in my own code to add input/output batching and I discover that this option is not accepted when constructing parser from the code. Despite javadoc saying that LexicalizedParser.

I received the Digger beta invite

This was the fastest beta invite confirmation ever. Unfortunately, Digger’s Terms of Service do not allow any sort of disclosure about features or results from it. This is very different from Powerset which has been going out of its way to get beta subscribers (even unconfirmed ones) to know what they are doing. Digger does not even seem to have a blog, which contradicts the rules for a web2.0 company.

Digger – Another NLP enhanced search engine (beta)

Powerset hasn’t even started competing with Google yet and already it has its own competitor. [Powerset hasn’t even started competing with Google yet and already it has its own competitor. ]1 - which is currently in private beta - does sense disambiguation of the search terms like everybody else. Unlike everybody else, however, they expose the underlying WordNet definitions to the searcher and allow them to pick, rate and even discuss the senses a la Digg or maybe Search Wikia concept.

Another large syntax tree visualiser

I found another online syntax tree visualiser that can cope with large trees - phpSyntaxTree. It requires square brackets instead of the lisp s-expression ones, but it should not be too hard to convert from one to another. There is also a Ruby version of the application from a different developer, but it refused to display one of my large sentences that the original had no problems with. The result of the parse is an image and is fairly easy to read with the leaves (words) laid out in strata (stratas?

Learning english prepositions – the smart way

In my review of WordChamp and LingQ I mentioned that an ideal language learning system would have deep support for the specifics of the learner’s target language. I was asked to clarify what I mean by that. I have now found an example of what could be a step in the right direction. It is an online research system called WERTi from the Computational Linguistics and Language Technology group at The Ohio State University.