Inspiration

The prevalence and impact of breast cancer on individuals and families is immense. Diseases like cancer have higher rates of fatality but some cancers like that of the breast, if detected early in the patient, are treatable. With the understanding that early detection and tailored treatments can significantly improve outcomes, I wanted to harness the power of data to uncover valuable insights that could support medical professionals and researchers in their efforts to fight this disease. Through data analysis, I aim to contribute to a greater understanding of breast cancer risk factors, survival patterns, and treatment implications.

What it does

This project analyzes a breast cancer dataset to identify key trends and insights that could benefit clinical research and treatment planning. Through exploratory data analysis, it examines factors like tumor size, hormone receptor status, cancer stages, and survival rates to reveal patterns that could influence early detection strategies and personalized treatments. By uncovering correlations and visualizing survival trends, the project aims to support a deeper understanding of risk factors and outcomes, providing a data-driven foundation for potential medical interventions and patient care improvements.

How I built it

I built this project using Python, using libraries like pandas for data manipulation, matplotlib and seaborn for visualization. Firstly, I checked if data is complete and accurate. Then, I performed exploratory data analysis (EDA) to understand key statistics, correlations, and patterns within the data. Finally, I was able to identify insights into factors affecting survival rates and cancer progression.

Challenges I ran into

To build this project, I had to learn about libraries like matplotlib and seaborn since I had not used them before. I also had to study how to implement, comprehend and use correlations in data.

Accomplishments that I'm proud of

I managed to build a comprehensive project that has the potential to determine key insights from a dataset. The insights determined from the analysis can drive clinical decisions. This project can also be used to assist doctors with educating patients on how common breast cancer is and what factors put patients at risk.

What I learned

This was the first time I was dealing with a dataset which pertains to breast cancer. Since I am not studying medicine, I was unaware of several data attributes and what they meant. Through this project, I managed to understand certain medical terms. Also, I managed to learn two new libraries - matplotlib and seaborn.

What's next for Breast Cancer Data Analysis

This project can be used to build a machine learning based model. The insights derived from this project can be used to predict the likelihood of breast cancer in patients.

Built With

Share this project:

Updates