Published April 28, 2025 | Version v1
Dataset Open

Rainfall Prediction: Comparison of 7 Popular Models

  • 1. ROR icon TU Wien

Description

Rainfall Prediction using 7 Popular Models

Context and Methodology

Research Domain/Project:

This dataset is part of a machine learning project focused on predicting rainfall, a critical task for sectors like agriculture, water resource management, and disaster prevention. The project employs machine learning algorithms to forecast rainfall occurrences based on historical weather data, including features like temperature, humidity, and pressure.

Purpose:

The primary goal of the dataset is to train multiple machine learning models to predict rainfall and compare their performances. The insights gained will help identify the most accurate models for real-world predictions of rainfall events.

Creation Process:

The dataset is derived from various historical weather observations, including temperature, humidity, wind speed, and pressure, collected by weather stations across Australia. These observations are used as inputs for training machine learning models. The dataset is publicly available on platforms like Kaggle and is often used in competitions and research to advance predictive analytics in meteorology.

Technical Details


Dataset Structure:

The dataset consists of weather data from multiple Australian weather stations, spanning various time periods. Key features include:

Temperature
Humidity
Wind Speed
Pressure
Rainfall (target variable)
These features are tracked for each weather station over different times, with the goal of predicting rainfall.

Software Requirements:

Python: The primary programming language for data analysis and machine learning.
scikit-learn: For implementing machine learning models.
XGBoost, LightGBM, and CatBoost: Popular libraries for building more advanced ensemble models.
Matplotlib/Seaborn: For data visualization.
These libraries and tools help in data manipulation, modeling, evaluation, and visualization of results.
DBRepo Authorization: Required to access datasets via the DBRepo API for dataset retrieval.

Additional Resources

Model Comparison Charts: The project includes output charts comparing the performance of seven popular machine learning models.
Trained Models (.pkl files): Pre-trained models are saved as .pkl files for reuse without retraining.
Documentation and Code: A Jupyter notebook guides through the process of data analysis, model training, and evaluation.

Files

10-Model_Comparison_ROC_Cohens.png

Files (196.1 MiB)

Name Size
md5:897c561018b03288b6181eebaa1357f8
1.0 KiB Download
md5:fe36172214a18ff18893fd407f360b9d
17.5 KiB Preview Download
md5:e13f783e0cd9be3d8548e80572562689
31.0 KiB Preview Download
md5:4e334bb1dfc4ace42a404a15909144d9
57.3 KiB Preview Download
md5:4296542d1fc792e7370aab24ece1dba8
1.4 MiB Download
md5:a19ae293a472c140fe0d0ae69baaf88e
18.0 KiB Preview Download
md5:ce64ae87e8c375c0254a7b89f12fd500
29.4 KiB Preview Download
md5:3b8f0072affb2b12bb9b10b39ca864f7
47.9 KiB Download
md5:e71f31ca7aa7d5bf7080cb30a124a2a5
17.7 KiB Preview Download
md5:4c56f3dcabce95b4de09672c3705012f
29.0 KiB Preview Download
md5:554a35a7f19aebbcecce6e74ea2d091e
125.2 MiB Download
md5:d51dab3e5ac7248ed8a6dc4e49af7905
17.9 KiB Preview Download
md5:b2e5303578018039c472d191c0d29e76
29.1 KiB Preview Download
md5:597f74be9d1bedd0f61caf8b87367f7f
1.1 MiB Download
md5:07ce72da1d75f1ca9e65e40a8e4e9cc0
18.2 KiB Preview Download
md5:32753e1fcf090e76bb206bd377cfd7ce
29.9 KiB Preview Download
md5:6879e6e235e2e5c0d3897346ce49fb6d
50.0 MiB Download
md5:71a79ac225178f7d8ef43f1c22c88c48
17.7 KiB Preview Download
md5:b623aefb19b22fc1d112ce71e65dd93a
28.9 KiB Preview Download
md5:436403a7fc5b70397b14e6de6f8744eb
14.9 MiB Download
md5:bbe433ee8c7d9b8f814d971db7ad946f
19.1 KiB Preview Download
md5:77080ad7078eccc70b7c2060ecf4707b
28.2 KiB Preview Download
md5:bac5cff0c478dca7668ec3d16a645c35
1.0 MiB Preview Download
md5:d2a3eed558c3120522360e415342ad81
57.3 KiB Preview Download
md5:1b5518bba30a27b8a15c4c9fb994d05b
2.0 MiB Preview Download
md5:b0178d54bbbe8a869315d030abed3e79
1.4 KiB Preview Download

Additional details

Dates

Submitted
2025-04-28