Skip to content
View shubhamshukla1177's full-sized avatar

Block or report shubhamshukla1177

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
shubhamshukla1177/README.md

Hi there πŸ‘‹

  • πŸ‘¨β€πŸ’» I'm a Data Engineer with 6+ years of experience, currently working in a fintech industry.
  • ✍ I've done my Masters in Computer Science and Machine Learning from George Mason University, Virginia.
  • πŸ‚ I'm currently building a scalable and robust multi-cloud data infrastructure on healthcare dataset.
  • πŸ” I've a strong interest in Natural Language Processing (NLP), I've built a BERT model for sentence classification. And I've also worked on a research paper to implement a cost-effective solution for Natural Language Understanding of long texts that can utilize pre-trained encoder-decoder models designed for short texts.
  • πŸ”§ As a believer in the DevOps ethos, I include agile DevOps methods into my data engineering workflows and I am still learning the best practices.
  • 🎯 My goal is to contribute to open-source projects and become a data architect.
  • πŸ“« You can reach out to me at shubhamshukla1177@gmail.com or https://www.linkedin.com/in/kumar-shubham-0945a1132/

Pinned Loading

  1. Data-Mining Data-Mining Public

    This repo contains source codes of data mining projects such as "Sentiment Analysis" on different datasets using Logistic regression,KNN

    Jupyter Notebook 1

  2. InformationSecurity InformationSecurity Public

    Contains some lab work of Public key encryption, Man-In-The-Middle Attack, RSA Algorithm, Transport Layer Security.

    C 1

  3. PySpark-In-Databricks PySpark-In-Databricks Public

    Scala 1

  4. CS657-Mining-Massive-Datasets-using-PySpark CS657-Mining-Massive-Datasets-using-PySpark Public

    This repo contains some assignments of the course CS-657 Mining massive dataset, taken in George Mason University under Prof. Daniel Barbara.

    Jupyter Notebook

  5. CS678_Advanced_NLP CS678_Advanced_NLP Public

    Python

  6. DatabaseSystems DatabaseSystems Public

    Database management project