Author Archives: Tal Rotbart

Those of you familiar with the SpringSense story will be aware that until recently, SpringSense held the title of the world’s most accurate noun-sense disambiguator, and delivered this accuracy in real-time.

In the July/August 2012 issue of IEEE Intelligent System, an academic paper from P. Chen, C. Bowes (Uni. of Houston) and W. Ding, M. Choly (Uni. of Massachusetts, Boston) described a technique that was able to surpass the accuracy of version 1.0 of our technology. Whilst unable to perform the task quickly enough to be useful to enterprise, we still didn’t like the idea of being number 2, so we set about reclaiming the title.

We still didn’t like the idea of being number two, so we set about reclaiming the title

The mission we charged our chief scientist with, was to do whatever it took to regain top position; Fred Rotbart, PhD, was told nothing was off-limits. In our early sessions as we investigated the possibilities, it quickly became clear that the basis of our existing approach was valid as it offered us real-time speed, something we couldn’t sacrifice if we wanted a solution that could be used by our customers in enterprise.

A way forward presented itself; our innovative approach to NLP was valid, but we needed to re-visit our implementation. Dr Rotbart proceeded to pull apart our algorithm, and put it back together well oiled and with less cruft, which edged us closer to our goal. Our big breakthrough though, came after a moment of insight from Dr Rotbart, leading him to find an alternative and more accurate way of using the results of our data-mining algorithm to perform the noun-sense disambiguation.

The result was an increase in accuracy to a world leading 83.4%, as measured by the industry and academic benchmark SemEval 4 (task 7), without sacrificing any of the performance that allows SpringSense to be used for high volume transactional usage, such as Big Data and enterprise search.

Being overtaken by a more accurate solution was a useful learning experience for us as a team. What we learned from the journey was that to be useful to our customers, a solution needs to work in real-time without sacrificing accuracy. Our mission here in the SpringSense team remains to lead the world in both speed and accuracy.

The new version with the accuracy improvements is already live on the Mashape API Hub, we’d love for you to try it out and give us your feedback; a free plan is offered for your evaluation. Bindings are available for Ruby, Python, Java and ElasticSearch and more.

Melbourne Solr/Lucene Users Group Logo

Melbourne Solr/Lucene Users Group Logo

Here at SpringSense we’ve noticed that there is a blossoming of Apache Solr/Lucene usage and development in Melbourne, but we’re missing an unofficial, relaxed gathering to allow some fruitful information and experience exchange.

We’re trying to put together a laid back meet up for developers (and other interested people) who are currently using Apache Solr (and/or Lucene) or would like to learn more about it. Aiming for it to be a high signal/noise ratio group, with meet ups probably once every two months.

The first meet up is still TBD, but please join the group if you’re keen to join us for pizza, beer, and a discussion about Solr once we figure out the date of the first meeting.

Also, please feel free to suggest quick (15 minute) presentations - whether it be a problem you’ve solved, a problem you need help solving or a general interesting experience of using Solr.

We’re keeping registrations here: http://www.meetup.com/melbourne-solr/

Feel free to pass to co-workers, colleagues who would be interested.

Back from the Big Apple!

Well after some VERY long flights we’re back from a great week in New York City and the successful launch of SpringSense at Enterprise Search Summit Spring 2011. It was great to meet and talk to so many search professionals and share our passion for a better way to search.

We had some productive meetings and have some exciting developments in the pipeline; all we can say for the moment is watch this space.

Highlights from the conference include meeting Eric Reiss after his keynote speech, “The Dumbing Down Of Intelligent Search”. Hopefully the organisers will post a copy of the presentation online- we’ll put up a link to it when they do.

And finally a big congratulations to Robert Boeri on winning the 24″ LCD HDTV giveaway; it couldn’t have gone to a nicer bloke.

Our booth at the Enterprise Search Summit 2011

Our booth at the Enterprise Search Summit 2011

It’s the day before, and our humble booth is all set-up at the Enterprise Search Summit in New York City. If you’re around we’d love to see you and have a chat!

I’m very happy to confirm that we are Gold Sponsors for the Spring 2011 Enterprise Search Summit in New York City on May 10-11th — come visit us at booth 19.