Inspiration
We saw a cool visualization of Venmo paper-trail data and decided to use Capital One's Nessie API to simulate customer behavior and purchasing decisions with hierarchical Bayesian modeling.
What it does
Simulates and generates data about customers and which types of products they purchase. Analyzes data and uses LDA clustering to rediscover the underlying connections between different merchants.
How we built it
We used Capital One's Nessie API in Python to write a variety of functions for adding and measuring purchases between customers and merchants. Using the random module in num.py, we generated purchase data. We then used the gensim package to fit the data. Finally, we used d3.js in Javascript to create visualizations of the purchase data.
Challenges we ran into
We had no data, so we had to figure out a reasonable model for simulating interesting and realistic data. Improving runtime when processing large amounts of data was also a challenge.
Accomplishments that we're proud of
Developing functions for generating random names of people and of stores Creating realistic results that measure the connectedness between merchants
What we learned
We gained a better understanding of scripting in Python and writing modules between a variety of different data types. We also learned a lot about interactive data visualization in Javascript.
What's next for Nessie Vis
We'd love to apply our model to a real data set and showcase some discoveries about real-world customers and merchants!
Built With
- d3.js
- gensim
- javascript
- nessie
- python

Log in or sign up for Devpost to join the conversation.