CSAIL Event Calendar


Transforming Big Data with D4M

Speaker: JEREMY KEPNER, MIT-Lincoln Laboratory
Date: Friday, February 1 2013
Time: 11:40AM to 12:40PM
Location: 32-123
Host: Alan Edelman, CSAIL
Contact: Shirley Entzminger, 3-4347, Daisymae@math.mit.edu
Relevant URL:

SPECIAL Joint Computational Research in Boston and Beyond Seminar and New England Database Summit Talk

(To order a lunch please register at: http://db.csail.mit.edu/nedbday13/)


ABSTRACT:

The growth of bioinformatics, social analysis, and network science is forcing data scientists to handle unstructured data in the form of genetic sequences, text, and graphs. Triple store databases are a key enabling technology for this data and are used by many large Internet companies (e.g., Google Big Table, Amazon Dynamo, Apache HBase, and Apache Accumulo). Triple stores are highly scalable and run on commodity clusters, but lack interfaces to support efficient development of the mathematical algorithms used by many data scientists. D4M (Dynamic Distributed Dimensional Data Model) provides a parallel linear algebraic interface to triple stores. Using D4M, it is possible to create composable analytics with significantly less effort than using traditional approaches. The central mathematical concept of D4M is the associative array that combines spreadsheets, triple stores, and sparse linear algebra. Associative arrays are group theoretic constructs that use fuzzy algebra to extend linear algebra to words and strings. This talk describes the D4M technology, its mathematical foundations, application, and performance.

*********************************************************************************

Massachusetts Institute of Technology
Cambridge, MA

For more information, please visit...


http://math.mit.edu/crib

See other events happening in February 2013


About Us Research News Resources Directory