- Autism is well-known neurodevelopment disorder commonly found in adults and children which is characterized by inflexible behaviors, interests and can affect one's social life. Our project aimed to simulated a machine learning algorithm to analyze a specific DNA sequence from Autism Spectrum Disorder and to allow the user to answer a questionnaire to determine if they likely to have autism using questions and their DNA sequence. While there are various DNA sequences involved with Autism. Our project focused on using Gene ID: NM_030627.4. The gene NM_030627.4 is known for coding the protein which is commonly known as **CPEB4 ** and medically referred to as Idiopathic Autism (which is 80% of autistic patients), this protein leads to less neuronal activity, social awkwardness, sensitive to loud noises, and various symptoms that are well-recognized by medical professionals. While many people are aware of Autism's effects on disability due to famous shows such as the 'Good Doctor', what many people don't realize is that autism is on the rise as supported by the (CDC) Center of Disease Control Research. A study that was published in the JAMA Network Open found that children and adults diagnosed with autism spectrum disorder (ASD) increased by 175% over a decade and there could be various reasons such environmental factors like pollution, lack of healthcare, rise of media, The problem of Autism is that its hard to identify and requires time and observational techniques. What's worse is that it is affecting many younger generations like children whose learning abilities shouldn't be hindered by Autism's effects.
- Total of 111 Proteins Detected
- Largest Proteins were stored and saved for further analysis
- High Lucine and Serine amino acids Detected, Low Glutamine (which is a key insight since its commonly low in autism)
- Moderate Stable Sequence in terms of GC %
- Low GC content (possible link to neurodegenerative abilities with neurons in the brain)
- Data: found in AutismProteinAnalysis.ipynb
- Article on Increased Glutamate & Homocytesine and Decreased Glutamine Levels in Autism: Strategies for future study of amino acids.

Autism Embedded with Deep Learning
- IEEE Research Paper (Autism diagnosis with a CNN model based on genetic mutations): https://ieeexplore.ieee.org/document/10638956, Published on IEEE on 23 August 2024
- IEEE Research Paper (Comprehensive Analysis of Autism Spectrum Disorder Detection Using Deep Learning) :https://ieeexplore.ieee.org/abstract/document/10763358, Published on IEEE on 3 December 2024
Medical Autism Research
- National Library of Medicine PubMed Central (Genetic contributions to autism spectrum disorder): https://pmc.ncbi.nlm.nih.gov/articles/PMC8477228/
- National Library of Medicine PubMed Central (Neuroactive Amino Acid Profile in Autism Spectrum Disorder: Results from a Clinical Sample): https://pmc.ncbi.nlm.nih.gov/articles/PMC9955282/
- National Library of Medicine PubMed Central (Amino Acid Patterns in Children with Autistic Spectrum Disorder: A Preliminary Biochemical Evaluation): https://pmc.ncbi.nlm.nih.gov/articles/PMC11767892/
- The Environmental and Genetic Causes of Autism Book by James Lyons-Weiler PhD, Richard E. Frye: https://www.audible.com/pd/The-Environmental-and-Genetic-Causes-of-Autism-Audiobook/B072KT5GT9?source_code=MSNPP30DTRIAL55003142490W1&gclid=2a5250ca70e51a7cabfc372a1f0c8697&gclsrc=3p.ds&msclkid=2a5250ca70e51a7cabfc372a1f0c8697, Provided by Audible in Amazon
- Arizona Autism United Article (How the Autism Diagnosis Has Evolved Over Time): https://azaunited.org/blog/how-the-autism-diagnosis-has-evolved-over-time?msclkid=208827c8134714efb89740287cae9839&utm_source=bing&utm_medium=cpc&utm_campaign=AZA%20United-%20Blog&utm_term=autism%20history&utm_content=History%20Of%20Autism
- Found in our app.py
- 10 Questions about possible signs
- Allows user to input DNA sequences from DNA tests and compare it with the CPEB4 Protein later on.
- Problem Statement: How can anlayze CPEB4 Protein structure of Autism and compare it from DNA sequences provided by the user using machine learning modeling
- Target Market: Autistic or Potentially autistic Patients (questionnaire), Biological Scientist (Bioinformatic Analysis)
- Competitor Analyst: - Competition: 23andMe, AncestryDNA, Genomic Health, AlphaFold, DeepMind - Advantages: Well-established Datasets, direct-to-consumer models. Accessibility, Advanced Searching, AI-drive protein structure prediction - Disadvantages: Limited Protein structure analysis, modeling accuracy
- Unique Value Proposition: User interaction, bridging gap between genetic testing, structural biology, and software development. Pattern recognition in ASD biomarkers
- Monetization: Subscriptions to premium machine learning models to refine system, collaborations with Biotech, healthcare providers, and university research
- Go-to-Market Approach: Government-funded, establishment of enterprise security for software such as firewalls, edge routers, and IPS once application grows with data. Non-profit research funds and donations to advance machine learning modeling.
- User data is translated to proteins to match length of the longest protein, it uses an empty _ list to represent a stop codon
- Using Hamming distance technique, it counts each character for similarity and compares

- Improve Block Coding & Analytical Accuracy
- More Analytical tools to modulate Autism databases from adults and children
- Fixing debugs and connect networks of data so they function together with no lag or issues
- Software Enhancements for working application and can increase awareness of Autism effects.
- Embedded system regulate speed of code and computational errors

- Estimated Cost (Year 1):
- Product Development: $150,000
- Software Engineering (AI & ML Development): $80,000
- Cloud Infrastructure & Data Storage: $30,000
- Genetic & Proteomic Database Licensing: $20,000
- Security & Compliance:$20,000
- Operations & Administrative: $100,000
- Office Space & Utilities: $30,000
- Legal & Regulatory Compliance: $20,000
- HR & Salaries (Support Staff): $50,000
- Marketing & Customer Acquisition: $75,000
- Digital Marketing & SEO: $30,000
- Conferences & Partnerships: $25,000
- Customer Support & Outreach: $20,000
- Research & Development: $125,000
- Academic Collaborations: $50,000
- Pilot Testing & Validation: $50,000
- Data Collection & AI Training: $25,000
- Total Estimated Budget: $450,000
- Our software product entails to follow User and Patient Privacy in compliance of HIPAA and Privacy Rules, as demostrated below, we do want the user to know they are entitled to their own rights.
- https://www.hhs.gov/hipaa/for-individuals/guidance-materials-for-consumers/index.html
- Our Product ensures through enterprise security as authentication, Edge Routers, firewalls, and IPS that our system will be secured and protected by a secuirty and compliances team working together to make sure our program runs smoothly and safely.
- Your skills and donations along with software development and machine learning can help communities find cures, treatments, and potentially solutions to dealing with diseases like Autism.
- Autism Speaks: https://www.autismspeaks.org/?form=donate&utm_medium=paidsearch&utm_source=bing&utm_campaign=IS-branded-conv&utm_content=IS-evergreen&msclkid=3bcf11fea5921da2859f761ecfcea87d
- American Parkinson Disease Association: https://www.bing.com/search?q=Autism%20research%20donationa%20nd%20support&qs=n&form=QBRE&sp=-1&ghc=1&lq=0&pq=autism%20research%20donationa%20nd%20suppor&sc=12-35&sk=&cvid=C89F6A9ACD62418790AEB40676BF436F&ghsh=0&ghacc=0&ghpl=
- Swiss Stem Cells Treatment Clinic: https://swissmedica.startstemcells.com/autism1/usa?msclkid=434ab421a9b81fe8fd1bd3c1c3e4a24b&utm_source=bing&utm_medium=cpc&utm_campaign=Search_Autism_USA_EN&utm_term=autism%20support%20services&utm_content=EN%20autism
- Autism Research Institute: https://autism.org/donate-giving/






