GeoNLP Project - Software, Data and Services for Open Toponym Information System
GeoNLP - Software for automatically geotagging natural language text (for Japanese)
GeoLOD - Service for searching and sharing toponym data
Geoshape Repository - Website for sharing data of geographic shapes
GeoNLP Toponym Web Service - Website for analyzing natural language text using gazetteers
Purpose of the Project
Putting natural language text such as news, blogs and tweets on the map is what many people want to do, because that promotes our understanding about the spatial context of the text. It is especially critical for emergency information media, because instant and automatic mapping for the stream of information from many sources allows us to respond quickly to the situation.
This mapping is relatively easy if we have machine-readable location metadata in addition to text, but usually this cannot be expected because natural language text can satisfy human-readability without structured information. Although the geocoding of a structured address became practical, geo-tagging of natural language text still remains to be a difficult task.
Hence the purpose of thie project is to make a geo-tagging system giving location metadata to natural language text using geographic information systems (GIS) and natural language processing (NLP). Moreover, to establish a ecosystem to support sustainable development of the system, we also focus on developing dictionaries of geographic named entities through collaboration with linked open data initiatives and participatory / voluntary systems, and through the development of libraries that can be used by other frameworks for web development.
Asanobu KITAMOTO, Ken T. MURATA,
"Construction of Toponym Information Platform Including Historical Administrative Region Dataset Beta and Its Application to Historical Big Data",
Technical Report of Information Processing Society of Japan (IPSJ),
2020-9 (in Japanese)
"Open Toponym Information System GeoNLP - A Tool to Analyze and Share Ambiguous Toponyms in Text",
The Journal of Survey,
2014-9 (in Japanese)
"Crisis Media Project",
Report Meeting on Big Data Workshop of Great East Japan Earthquake, 2012-10 (in Japanese)
Asanobu KITAMOTO, Takeshi SAGARA, Masatoshi ARIKAWA,
"GeoNLP: Toward Intelligent Geo-Tagging for Natural Language Text",
CSIS Days 2011,
2011-11 (in Japanese)
Please also see GeoNLP news (in Japanese) about the detail of GeoNLP project.
- July 8, 2021
GeoNLP Version 2.0 was released. And GeoLOD has started its operation.
- October 20, 2018
Technical detail of GeoNLP was added to the Japanese version.
- March 20, 2018
GeoNLP Data and GeoNLP Service was updated to solve troubles.
- January 15, 2017
Geoshape Repository was released.
- March 21, 2016
GeoNLP Version 1.2.0 was released.
- March 27, 2015
GeoNLP Version 1.1.0 was released.
- March 7, 2014
We received best award, infrastructure division at Linked Open Data Challenge Japan 2013.
- November 16, 2013
We received Prize for Encouragement at Geo Activity Fiesta.
- March 12, 2012
News Analysis on 2011 Great East Japan Earthquake now allows search by placenames using GeoNLP.
- November 11, 2011
Best Presentation Award received at CSIS Days 2011 for our presentation GeoNLP: Toward Intelligent Geo-Tagging for Natural Language Text.
Example of Websites using GeoNLP
- Futtekitter (in Japanese)
- A website for collecting and analyzing information from Twitter on something fallen from the sky, such as "snow" an "rain," and extracting placenames automatically for visualizing them on the map.
- News Analysis on 2011 Great East Japan Earthquake
- A website for extracting placenames from news related to Great East Japan Earthquake, and for summarizing articles by prefectures and local governments, or public facilities such as schools, to keep memories of the earthquake.
- Digital Typhoon: News Topics
- A website for extracting placenames from news related to typhoons, and for summarizing articles by typhoon numbers to analyze what happened by each typhoon.
The project is supported by the following.
The project was also supported by the following.
Copyright 2011-2021, Asanobu KITAMOTO, National Institute of Informatics.