1. Dataset

This page releases datasets and software of the Digital Typhoon project. The datasets are useful not only for machine learning but also for the quantitative analysis of meteorological studies.

Dataset

The dataset is a typhoon-centered image dataset created from hourly meteorological satellite infrared channel images. Data from successive generations of Himawari weather satellite images since 1978 are converted to brightness temperatures, and calibrated for different satellite sensor observations, resulting in a uniform spatio-temporal dataset spanning more than 40 years.

Version Target Release
Version 1 Nortdern Hemisphere November 3, 2023
Version 2 Nortdern / Soutdern Hemispheres November 26, 2024

Paper

  1. Version 2: Asanobu Kitamoto, Erwan Dzik, Gaspar Faure, "Machine Learning for the Digital Typhoon Dataset: Extensions to Multiple Basins and New Developments in Representations and Tasks", arXiv:2411.16421, 2024.
  2. Version 1: Asanobu Kitamoto, Jared Hwang, Bastien Vuillod, Lucas Gautier, Yingtao Tian, Tarin Clanuwat, "Digital Typhoon: Long-term Satellite Image Dataset for the Spatio-Temporal Modeling of Tropical Cyclones", NeurIPS 2023 Datasets and Benchmarks (Spotlight), 2023 (Digital Typhoon: Long-term Satellite Image Dataset for the Spatio-Temporal Modeling of Tropical Cyclones, arXiv:2311.02665.).

Please browse the list of publications to find other references on typhoons (filter by 'typhoon').

Software

Pyphoon is a Python library for machine learning research using the digital typhoon datasets.

Data Repository

Digital Typhoon Dataset is also available from DIAS (Data Integration and Analysis System). DIAS is a Japanese data repository for earth science and environmental datasets and can assign the dataset DOI (Digital Object Identifier) (doi:10.20783/DIAS.664) as a persistent identifier.

  1. Digital Typhoon Dataset

Acknowledgment

About the meteorological satellite Himawari imagery, most of them were purchased from Japan Meteorological Business Support Center, but some of them were received at Institute of Industrial Science, University of Tokyo. Please see the details at Sources of Various Data.

About the meteorological satellite Himawari imagery, most of them were purchased from Japan Meteorological Business Support Center, but some of them were received at Institute of Industrial Science, University of Tokyo. Please see the details at Sources of Various Data.

Many people have contributed to the development of the Digital Typhoon dataset. In particular, the following internship students of Kitamoto laboratory has been involved in the research of machine learning algorithms and the development of software libraries for the dataset.

  1. Danlan Chen
  2. Lucas Rodes Guirao
  3. Alexander Grishin
  4. Clément Playout
  5. Izabela Horvath
  6. Jean-Paul Lam
  7. Jared Hwang
  8. Bastien Vuillod
  9. Lucas Limos Gautier
  10. Gaspar Faure
  11. Erwan Dzik
  12. Tong Ngoc Anh
  13. Veleva, Anja