Inspiration

Our inspiration came from different machine learning tutorials, thus each of us started with different approaches and methods, but later combined and tested our algorithms together to find out the best one with the highest accuracy

What it does

We finally agreed that the Random forest classification algorithm has the best performance of all algorithms. With the highest learning accuracy and test accuracy, RFC proved its potential in learning and analyzing data with the correct data clustering, thus we focused on the RFC algorithm and made it our final submission

How we built it

We built the RFC model with the following steps

  1. split the dataset and pre-processing data
  2. build the model
  3. train the model
  4. evaluate the model with metrics
  5. tune the hyperparameters

Challenges we ran into

  1. choose the best algorithm base on performance
  2. analyze data and select the most important columns to feed the algorithm
  3. improve the performance of the RFC model

Accomplishments that we're proud of

  1. we tried a variety of algorithms at the beginning stage
  2. the RFC model is promising and has the highest accuracy
  3. our way of analyzing input data is efficient

What we learned

  1. how to split and cluster original data
  2. more ML algorithms and their coding technique
  3. how to test and improve ML algorithms

What's next for the AI dataset

there should be a better way to clean the data, so if we have more time, we can try to make the input data cleaner to achieve higher accuracy

Built With

Share this project:

Updates

posted an update

Inspiration

Our inspiration came from different machine learning tutorials, thus each of us started with different approaches and methods, but later combined and tested our algorithms together to find out the best one with the highest accuracy

What it does

We finally agreed that the Random forest classification algorithm has the best performance of all algorithms. With the highest learning accuracy and test accuracy, RFC proved its potential in learning and analyzing data with the correct data clustering, thus we focused on the RFC algorithm and made it our final submission

How we built it

We built the RFC model with the following steps

  1. split the dataset and pre-processing data
  2. build the model
  3. train the model
  4. evaluate the model with metrics
  5. tune the hyperparameters

Challenges we ran into

  1. choose the best algorithm base on performance
  2. analyze data and select the most important columns to feed the algorithm
  3. improve the performance of the RFC model

Accomplishments that we're proud of

  1. we tried a variety of algorithms at the beginning stage
  2. the RFC model is promising and has the highest accuracy
  3. our way of analyzing input data is efficient

What we learned

  1. how to split and cluster original data
  2. more ML algorithms and their coding technique
  3. how to test and improve ML algorithms

What's next for the AI dataset

there should be a better way to clean the data, so if we have more time, we can try to make the input data cleaner to achieve higher accuracy

Log in or sign up for Devpost to join the conversation.