Washington, D.C. -- Disease statistics buried within patient records or detailed in newspaper clippings can be sorted and organized to depict geographic patterns, allowing the discovery of trends that were previously overlooked, according to a Penn State geographer.
"The use of interactive maps and graphs, combined with word search interfaces, can lead to greater insight into complex events like the spread of Swine flu," said Frank Hardisty, research associate, Penn State GeoVISTA Center.
The GeoViz Toolkit is a user-friendly application that combines text mining with geographical mapping. It allows users to search publically available data to identify and visualize data patterns for their own interests or concerns.
The flexible software package allows someone with no programming experience to navigate the application, while also providing different components and analytical tools for experienced analysts.
"Potential applications range from research in public health -- infectious disease dynamics, cancer etiology, surveillance and control -- through analysis of socioeconomic and demographic data, to exploration of patterns of incidents related to terrorism or crime," said Hardisty.
Many sources for disease and crime statistics -- newspaper articles for example -- are in a semi-structured format that do not clearly present the data in a table or graph, but rather bury it within the text of the document.
To obtain high-quality, relevant information from these documents, researchers use "text analytics" or '"text mining," allowing them to retrieve only applicable information, like the date and description of a disease-related death, from the flood of information usually included in a newspaper clipping.
"An example would be searching a database of H1N1 flu reports for 'child' or 'children' and seeing if there is spatial clustering in the relative frequency of those reports," Hardisty told attendees to
|Contact: A'ndrea Elyse Messer|