Sign In

Communications of the ACM

ACM TechNews

Democratizing Data Science

View as: Print Mobile App Share:
The tool addresses a bottleneck in the data science field.

Researchers at the Massachusetts Institute of Technology hope to advance the democratization of data science with a new tool for non-programmers that automatically generates models for analyzing raw data.

Credit: Christine Daniloff/MIT

Massachusetts Institute of Technology (MIT) researchers have developed a tool for nonstatisticians that automatically generates models for analyzing raw data.

The tool takes in datasets and generates sophisticated statistical models normally used by experts to analyze, interpret, and predict underlying patterns in data.

The tool currently resides on Jupyter Notebook, an open source Web framework that allows users to run programs interactively in browsers; users can write just a few lines of code to uncover insights into a range of topics.

The system uses Bayesian modeling, a statistical method that continuously updates the probability of a variable as more information about the variable becomes available.

The tool uses a modified version of "program synthesis," a technique that automatically creates computer programs given data and a language to work within.

Said MIT’s Feras Saad, “People have a lot of datasets that are sitting around, and our goal is to build systems that let people automatically get models they can use to ask questions about that data.”

From MIT News
View Full Article


Abstracts Copyright © 2019 SmithBucklin, Washington, DC, USA


No entries found