Loading…
Saturday, October 28 • 2:00pm - 2:22pm
The Power of Parallelism

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

As the size of datasets grows beyond the capabilities of even entire teams of humans to curate, there is a growing need to automate the categorization of records and removal of errors. This talk will discuss the advances in machine learning and the types of data processing pipelines that allow for the massive parallel processing of datasets to automatically clean and categorize even the largest of datasets.

 


Speakers
avatar for Aaron Levine

Aaron Levine

Research Scientist, Rakuten
Aaron Levine is a Research Scientist for the RIT-Boston group, focusing on massive distributed categorization of text. He obtained his masters from Brandeis University in 2015, where he worked on spatial attribute extraction and the UN ISO 24617-7 standard for spatial annotation... Read More →


Saturday October 28, 2017 2:00pm - 2:22pm JST
4F Rakuten A