About me

Currently I am the Lead Data Scientist at Popertee Ltd in Dublin, Ireland, leading the research and development of a location data platform for pop-up retail and experiential marketing campaigns.

Prior to this I was a Natural Language Processing Specialist at award-winning health-tech startup babylon in London, mostly concerned with  chatbots, information extraction and assorted applications of biomedical natural language processing.

I’m also an occasional contributor and editor at Wide Orbits, an online community dealing with ideas, issues and culture based in Dublin, Ireland where I’ve written about topics as diverse as computational creativity and Brazilian politics.

How it came to this

I completed my doctorate in the Computational Linguistics Group at Trinity College Dublin under the supervision of Dr. Carl Vogel in 2013 entitled: “Translation Effects in English Natural Language text”as part of the Centre for Next Generation Localisation (CNGL), an SFI-funded CSET research centre.

In my thesis I looked at questions related to stylometry and authorship in the context of  translations, applying machine learning techniques to tease out signals in literature.

After a brief foray deploying and training commercial information extraction systems in the Irish Times, I worked as a postdoctoral researcher at the Centre for Applied Data Analytics Research (CeADAR), based at University College Dublin, where I applied my knowledge of natural language processing to problems in social media analysis and social entity resolution. I also worked on graph analytics and data visualisation projects at CeADAR.

More information can be found on both LinkedIn and Twitter.

Over the years I’ve worked on diverse topics in text analytics such as:

  1. Applications of text analytics and machine learning to literary translations.
  2. investigating the stylistic patterns of character in dramatic works.
  3. Textual analysis of social media and news.
  4. Training and deploying commercial NER systems.
  5. Developing a prototype text alignment tool for translations.
  6. Quality estimation for translation.
  7. Identifying errors in non-native speaker English.
  8. Authorship attribution of translations.
  9. Diachronic classification of Latin and English text.
  10. Computational creativity and generation of apt titles for news articles.
  11. Biomedical entity and relation extraction.
  12. Biomedical natural language generation.
  13. Geolocation systems for short-term retail

Deutsch wird ab und zu gesprochen. Português também. Beagánín Gaeilge freisin.

Guitar at Yossarian Lives, drums At Speed.


Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s