Last week I had the privilege of organising the 13th meeting of the London Text Analytics group, which featured two excellent speakers: Despo Georgiou of Atos SE and Diana Maynard of Sheffield University. Despo’s talk described her internship at UXLabs where she compared a number of tools for analysing free-text survey responses (namely TheySay, Semantria, Google Prediction API and Weka). Diana’s talk focused on sentiment analysis applied to social media, and entertained the 70+ audience with all manner of insights based on her expertise of having worked on the topic for longer than just about anyone I know. Well done to both speakers!
Archive for the ‘Text analytics’ Category
Posted in Search, Text analytics, User experience, tagged Information Retrieval, Information seeking, Information visualization, log analysis, search logs, search strategies, User segmentation on June 3, 2014| Leave a Comment »
In a previous post I discussed some initial investigations into the use of unsupervised learning techniques (i.e. clustering) to identify usage patterns in web search logs. As you may recall, we had some initial success in finding interesting patterns of user behaviour in the AOL log, but when we tried to extend this and replicate a previous study of the Excite log, things started to go somewhat awry. In this post, we investigate these issues, present the results of a revised procedure, and reflect on what they tell us about searcher behaviour.
Posted in Search, Text analytics, User experience, tagged Information Retrieval, Information seeking, Information visualization, log analysis, search logs, search strategies, User segmentation on April 24, 2014| 5 Comments »
As I mentioned in a previous post I’ve recently been looking into the challenges of search log analysis and in particular the prospects for deriving a ‘taxonomy of search sessions’. The idea is that if we can find distinct, repeatable patterns of behaviour in search logs then we can use these to better understand user needs and therefore deliver a more effective user experience.
We’re not the first to attempt this of course – in fact the whole area of search log analysis has an academic literature which extends back at least a couple of decades. And it is quite topical right now, with both ElasticSearch and LucidWorks releasing their own logfile analysis tools (ELK and SiLK respectively). So in this post I’ll be discussing some of the challenges in our own work and sharing some of the initial findings.
Posted in Search, Text analytics, User experience, tagged Information Retrieval, Information seeking, Information visualization, log analysis, search logs, search strategies, User segmentation on March 18, 2014| 1 Comment »
Over the last few months I have been working with Paul Clough and Elaine Toms of Sheffield University on a Google-funded project called ‘A Taxonomy of Search Sessions’. A session, in case you’re wondering, is defined as a period of continued usage between a user and a search application. So if you spend a while Googling for holiday destinations, that’s a session. Sessions are interesting because they form a convenient unit of interaction with which to study usage patterns, and these can provide insights that drive improved design and functionality.
After a brief hiatus I’m pleased to say the London Text Analytics meetup resumed last night with an excellent set of talks from the participants in the AnnoMarket project. For those of you unfamiliar, this project is concerned with creating a cloud-based, open market for text analytics applications: a kind of NLP ‘app store’, if you will. The caveat is that each app must be implemented as a GATE pipeline and conform to their packaging constraints, but as we’ve discussed before, GATE is a pretty flexible platform that integrates well with 3rd party applications and services.
I have an intern who will shortly be starting a project to extract sentiment from free text survey responses from the healthcare domain. She doesn’t have much programming experience, so is ideally looking for a toolkit /platform that will allow her to experiment with various approaches with minimal coding (e.g. perhaps just some elementary scripting etc.).
Free is best, although a commercial product on a trial basis might work. Any suggestions?
- How do you compare two text classifiers?
- Text Analytics Summit Europe – highlights and reflections
- How do you measure site search quality?
- Prostitutes Appeal to Pope: Text Analytics applied to Search
- The role of Natural Language Processing in Information Retrieval