Castanet: automatically generating a browsing structure for a collection
A debate seems to come up when folks in charge of organizing digital collections get together: standardized schema such as the Library of Congress Subject Headings are annoying to read, outdated, and...
View ArticleTools for Exploring Text: Visualization
In the light of my new tool to help navigate the New York Times, I’ve been reading about previous approaches to the problem of making sense of large collections of text. As far as I can tell, the...
View ArticleText mining 19th century novels with the Stanford Literature Lab
Yesterday, I attended a group meeting with the Literature Lab at Stanford University’s English Department, where they presented some very cool new results on mining 19th Century British and American...
View ArticleMetaOptimize: Q+A for the large data set community
Joseph Turian & co. at MetaOptimize have started a Q+A forum for “data geeks” – people in machine learning or data mining who deal with questions about visualizing, processing, or otherwise making...
View ArticleTools for Exploring Text: Natural Language Processing
Natural language processing (NLP), also known as computational linguistics, is a set of models and techniques for analyzing text computationally. In the context of the digital humanities, it can help...
View ArticleExtracting Social Networks from 19th Century Novels
This year’s conference of the Association for Computational Linguistics, the most prestigious event in computational linguistics, had a paper that got me very excited. It’s called Extracting Social...
View ArticleWordSeer: Exploring Language Use in Slave Narratives
More and more source text in the humanities gets digitized every day, making it accessible to large scale computational analysis. Nevertheless, traditional methods of humanistic analysis are based on...
View ArticleDigital Humanities and the Future of Search
On Tuesday, Feb. 1, I’ll be presenting my latest project WordSeer, at the Farsight 2011 conference on the future of search. This event will be streamed live from TechCrunch, the tech world’s favorite...
View ArticleEmpirical Study: Finding Examples of a Theme, by Example
A common task in literature study is to find examples of a theme. Until now, literary scholars searching for examples have had to rely on searching for sets of words they think are associated with the...
View Article