Team Submission for the 2024 TUMai Hackathon

Challenge Sponsor: REPLY

Challenge: Develop predictive maintenance solutions to optimize energy production and prevent unexpected downtimes in wind turbines.

Authors: Friedrich Siemers, Phillip Wondra, Benedikt Kellner, Maximilian Seeger

Files and what they do

hackathon_EDA and umap.ipynb

multivariate and non-linear feature inspection based on pairplots and UMAP. Contains some nice vizualisations

hackaton_generate_labels.ipynp

generation of target labels: generates tuples of turbine and date where one of the selected errors happens

hackaton_ML_labeloneday.ipynb

Transformation of problem into multi-day-ahead classification problem containing feature creation, selection, coarse graining, and application of ML models. Contains a train set evaluation

hackaton_feature_ueberpruefunhg.ipynb

code for univariate feature inspection

Data used

First the provided data from REPLY was analyzed. Based on this analysis we switched to a larger dataset which can be found here: https://zenodo.org/records/5841834

Introduction

The challenge posed by REPLY was to leverage wind turbine system data to develop predictive maintenance strategies that would enhance energy efficiency and operational reliability. Our goal was to minimize downtime, maximize power output, and extend the lifespan of wind turbines using machine learning techniques.

Approach and Methodology

Data Review and Selection

Initial data provided by REPLY lacked failure signals; only weather data was available which was insufficient for predictive analysis.
Sourced additional datasets from Penmanshiel and Kelmarsh Wind Farms in England, containing over 400 data points including failures and SCADA sensor data.
Selected Kelmarsh Wind Farm data for initial focus due to its comprehensive data on failures and sensor readings.

Step-by-Step Process

Understanding Failures
- Evaluated failures based on their frequency, predictability, and preventability.
- Identified economic impacts and assessed if maintenance could prevent damages.
- Focused on frequency converter errors due to time constraints, aiming to predict and prevent this specific failure.
Feature Reduction
- Analyzed datasets to identify where less than 10% of the data were NaN values.
- Performed technical reduction to determine 48 relevant features that would be crucial for the machine learning model.
Error Data Analysis
- Quantified the occurrence of frequency converter errors and the associated downtime.
Feature Data Analysis
- Conducted visual inspections comparing damage periods with random periods to detect patterns and correlations.
- Repeated this analysis for different time frames (daily, weekly, multi-weekly) to ensure robustness.
Visualization and Dimension Reduction with Umap
- Explored how error labels distributed across different conditions.
- Detected clusters, although noted redundancy due to multiple counts of events per day.
Data Cleaning
- Consolidated multiple error messages from the same day into a single record to refine the dataset.
Insufficient Data for Reliable Predictions
- Post-cleaning, it was evident that the dataset contained limited failure data, with only 14 points available for predictive modeling.
Time Series Analysis
- Developed time series data in intervals of 24, 48, and 72 hours before damage.
- Created aggregated features such as mean and standard deviation within these windows to enhance model accuracy.
Further Feature Reduction and Predictive Modeling
- Applied feature importance metrics and techniques like ANOVA and recursive feature elimination with logistic regression to minimize overfitting.
- Generated a logistic regression model to predict future frequency converter errors.
Model Testing
- Applied the developed model to the Penmanshiel dataset for validation, which did not yield successful predictions due to data limitations.

Challenges Encountered

Data Limitations: The scant amount of failure data restricted the statistical relevance and the training robustness of our predictive model.
Short Data Span: Limited error data points undermined the ability to train a model capable of generalizing across different operational scenarios.

Conclusion and Recommendations

Despite the initial setbacks due to data insufficiency, our team was able to develop a methodology to identify and predict critical failures in wind turbine operations, specifically targeting frequency converter errors. For future work, acquiring more comprehensive datasets with a broader range of failure instances is crucial to refining the predictive models. Our efforts have laid the groundwork for developing predictive maintenance strategies that can significantly reduce downtime and enhance the efficiency of wind turbines.

Future Directions

Expand data collection efforts to include a wider array of error types and operational conditions.
Explore more advanced machine learning algorithms that can handle sparse data more effectively.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
.DS_Store		.DS_Store
.gitignore		.gitignore
3d_umap_visualization.html		3d_umap_visualization.html
Don Quixote.png		Don Quixote.png
PräsentationProjektHackaton.pdf		PräsentationProjektHackaton.pdf
PräsentationProjektHackaton.pptx		PräsentationProjektHackaton.pptx
Team Submission for the 2024 TUMai Hackathon.pdf		Team Submission for the 2024 TUMai Hackathon.pdf
hackathon_EDA and umap.ipynb		hackathon_EDA and umap.ipynb
hackaton_ML_labeloneday.ipynb		hackaton_ML_labeloneday.ipynb
hackaton_feature_ueberpruefunhg.ipynb		hackaton_feature_ueberpruefunhg.ipynb
hackaton_generate_labels.ipynb		hackaton_generate_labels.ipynb
readme.md		readme.md
umap_projektion_mit_fehlern.png		umap_projektion_mit_fehlern.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Team Submission for the 2024 TUMai Hackathon

Files and what they do

Data used

Introduction

Approach and Methodology

Data Review and Selection

Step-by-Step Process

Challenges Encountered

Conclusion and Recommendations

Future Directions

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Team Submission for the 2024 TUMai Hackathon

Files and what they do

Data used

Introduction

Approach and Methodology

Data Review and Selection

Step-by-Step Process

Challenges Encountered

Conclusion and Recommendations

Future Directions

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages