VOYAGE GROUP
東京都渋谷区神泉町8-16 渋谷ファーストプレイス8F
Registration is closed
Grab a drink and catch up with your fellow Rubyists.
This presentation will cover 3 natural language processing gems I’ve released over the past year:
* Pragmatic Segmenter (a sentence boundary detection gem)
* Chat Correct (a gem for English teachers/students that provides error analysis when an incorrect sentence is diffed with a correct sentence)
* Word Count Analyzer (a gem that analyzes a string for potential “word count gray areas” which cause tools to report different word counts)
The talk will cover various aspects of building these gems including working from first principles, testing edge cases, and getting comfortable with regular expressions. I’ll also introduce a project that is currently in-progress - a new algorithm for parallel text alignment and some of the related challenges with building it.
I am an American living in Japan interested in the intersection of Natural Language Processing (NLP), web development, and the translation industry. I’m currently a developer at TM-Town, a new translation enablement platform that matches professional translators with clients based on the translator's prior work. Before TM-Town I built an online CAT (Computer-assisted translation) Tool called Transdraft which is aimed at making CAT software easy and accessible for freelance translators. I have also developed a wide range of other applications and sites including a language acquisition application, a language chat correction application, as well as an alumni networking application.
Github: https://github.com/diasks2
Twitter: https://twitter.com/diasks2
Discuss the presentations or anything else Ruby related with the other attendees.
Tokyo Rubyist Meetup (trbmeetup) is an event that seeks to help bridge the Japan and international ruby and ruby on rails community. It will hold regular meetings where Japanese Rubyists can commun...
Join community