Japanese | English

GeoNLP Project - Toponym Information System for the Geotagging of Natural Language Text

Purpose of the Project

Putting natural language text such as news, blogs and tweets on the map is what many people want to do, because that promotes our understanding about the spatial context of the text. It is especially critical for emergency information media, because instant and automatic mapping for the stream of information from many sources allows us to respond quickly to the situation.

This mapping is relatively easy if we have machine-readable location metadata in addition to text, but usually this cannot be expected because natural language text can satisfy human-readability without structured information. Although the geocoding of a structured address became practical, geo-tagging of natural language text still remains to be a difficult task.

Hence the purpose of thie project is to make a geo-tagging system giving location metadata to natural language text using geographic information systems (GIS) and natural language processing (NLP). Moreover, to establish a ecosystem to support sustainable development of the system, we also focus on developing dictionaries of geographic named entities through collaboration with linked open data initiatives and participatory / voluntary systems, and through the development of libraries that can be used by other frameworks for web development.



January 15, 2017
Geoshape Repository was released.
March 21, 2016
GeoNLP Version 1.2.0 was released.
March 27, 2015
GeoNLP Version 1.1.0 was released.
March 7, 2014
We received best award, infrastructure division at Linked Open Data Challenge Japan 2013.
November 16, 2013
We received Prize for Encouragement at Geo Activity Fiesta.
March 12, 2012
News Analysis on 2011 Great East Japan Earthquake now allows search by placenames using GeoNLP.
November 11, 2011
Best Presentation Award received at CSIS Days 2011 for our presentation GeoNLP: Toward Intelligent Geo-Tagging for Natural Language Text.

Example of Websites using GeoNLP

Futtekitter (in Japanese)
A website for collecting and analyzing information from Twitter on something fallen from the sky, such as "snow" an "rain," and extracting placenames automatically for visualizing them on the map.
News Analysis on 2011 Great East Japan Earthquake
A website for extracting placenames from news related to Great East Japan Earthquake, and for summarizing articles by prefectures and local governments, or public facilities such as schools, to keep memories of the earthquake.
Digital Typhoon: News Topics
A website for extracting placenames from news related to typhoons, and for summarizing articles by typhoon numbers to analyze what happened by each typhoon.


Copyright 2011-2016, Asanobu KITAMOTO, National Institute of Informatics.