Project: Spanish Dialect Recognition
Jerid Francom is currently working with the ACTIV-ES corpus to develop an online interface to facilitate the classification and exploration of documents written in Spanish. Combining the Spanish language data and computational methods from information retrieval, the site will allow the public to upload a document of unknown origin, have that document classified as being from either Argentina, Mexico, or Spain, and then be able to visually identify those linguistic elements that are most and least indicative of the classified document. The aim is to make sophisticated computational tools available to a broad audience and facilitate identification of and bring awareness to dialect-specific and language-general patterns of the Spanish language.
Our goalThe DH Community is a program of Wake Forest's Humanities Institute. We are faculty from across campus interested in investigating the emergence of digital humanities as a field of study, and its relevance and usefulness as a research and teaching tool in the humanities.
Join the conversation!
Use your Wake Forest username and password to login and contribute to DH Talk.
Tag Cloudadministration alan liu big data close reading cloud culturomics definitions DH2014 digital pedagogy digital projects digital scholarship digitization distant reading funding hastac history humanities data curation internet language liberal arts libraries manuscripts maps media collections methods multimedia multimodal net neutrality omega organization pedagogy peer review quantitative analysis resource science spatial analysis Stanford DH statistics symposium teaching textual analysis THATCamp timelines Turing Test word frequency