Artificial Intelligence

Unsupervised Machine Learning for Categorizing and Clustering NIH Grants

Friday, May 20, 2011, 10:30am - 12:00pm PSTiCal
ISI, 11th Floor Large Conference Room
Ned Talley

The U.S. National Institutes of Health (NIH) consists of twenty-five Institutes and Centers that award ~80,000 grants each year. The Institutes have distinct missions and research priorities, but there is overlap in these missions and in the types of research they support, which creates a funding landscape that can be difficult for researchers and research policy professionals to navigate. In collaboration with researchers from ISI and other organizations, we have created a publicly accessible database ( ) in which NIH grants are topic modeled using Latent Dirichlet Allocation, and are clustered using a force-directed algorithm for placing grants as nodes in two dimensional space, where they can be accessed in an online map-like format.

BIO: Edmund (Ned) Talley is an NIH Program Director who manages grants on synaptic transmission and related research for the National Institute of Neurological Disorders and Stroke. For the past two years he has also been focused on NIH grants informatics, in order to address unmet needs at NIH, and to match these needs with burgeoning technologies in artificial intelligence, information retrieval, and information visualization.

