Large data sets may also include multiple objects of high dimensionality, such as images, that must be analyzed based on a relatively small number of samples. The mathematical analysis of problems like these requires expertise in statistics and probability methods, which Georgia Tech School of Mathematics professor and principal investigator Vladimir Koltchinskii will contribute to the new initiative.
Once massive amounts of data are collected and processed, relevant information must be pulled from it and presented using visual and interactive means. John Stasko, a principal investigator on this project and professor in the School of Interactive Computing, conducts research in the field of visual analytics.
He heads a team that developed Jigsaw, a visual analytics system that helps analysts better assess, analyze and make sense of large document collections. The system provides multiple coordinated views to show connections between entities extracted from a document collection.
"Jigsaw essentially acts as a visual index of the document collection helping analysts identify particular documents to read and examine next," explained Stasko, whose team won the university division of the 2007 Visual Analytics Science and Technology contest using Jigsaw.
Stasko also serves as Georgia Tech's director in the Department of Homeland Security-sponsored SouthEast Regional Visualization and Analytics Center (SRVAC), a regional center created in 2006 to perform research in visual analytics. SRVAC is a partnership between the Georgia Tech and the University of North Carolina Charlotte, and is one of five national university centers connected to the National Visualization and Analytics Center located at Pacific Northwest National Laboratory.
All of the steps involved in massive data analysis and visual analytics data collection, processing, analysis and visualization
|Contact: Abby Vogel|
Georgia Institute of Technology Research News