Risk Modeling & Decision Safety
Underwriting and fraud-risk workflows with calibration, threshold policies, abstention, validation, explainability, and review-focused reporting.
Data Scientist · Machine Learning · Decision Intelligence
I’m Amir Honardoust, a Data Scientist focused on explainable machine learning, forecasting, NLP, analytics, and practical AI systems people can understand and use.
What I do
My work connects statistical thinking, machine learning, product sense, and clear communication. I care about models that are evaluated, explainable, and useful beyond a notebook.
Underwriting and fraud-risk workflows with calibration, threshold policies, abstention, validation, explainability, and review-focused reporting.
Retrieval pipelines, knowledge-graph augmentation, text classification, recommender evaluation, and AI systems built for traceability.
Synthetic tabular-data evaluation, business prediction tools, reproducible model workflows, dashboard outputs, and portfolio-grade documentation.
Featured projects
Risk ML · Calibration · Abstention
GitHub ↗Built a decision-safety workflow for underwriting-style model review with corrected calibration, abstention policies, data validation, policy variants, slice safety reporting, and CI.
Fraud Risk · Explainability
GitHub ↗Developed a fraud-risk workflow with validation, cost-sensitive threshold search, policy artifacts, reason codes, SHAP explainability, and a Streamlit review dashboard.
RAG · Knowledge Graphs
GitHub ↗Built a retrieval-augmented question-answering system that combines vector retrieval, graph expansion, citation-aware context, optional LLM answers, FastAPI, and Streamlit.
Synthetic Data · Evaluation
GitHub ↗Compared synthetic tabular data generators using realism, distribution overlap, correlation preservation, privacy proxies, utility metrics, and visual diagnostics.
Recommendations · Evaluation
GitHub ↗Built a recommender-system demo with content-based filtering, corrected SVD scoring, hybrid blending, baseline comparisons, alpha sweep, tests, and structured outputs.
NLP · Classification
GitHub ↗Built an NLP classification pipeline for detecting unreliable news text using preprocessing, vectorization, model comparison, evaluation reporting, and clean documentation.
Business ML · Forecasting
GitHub ↗Created a business-focused profit prediction project with validation, baseline comparison, model selection, candidate scoring, risk notes, tests, CI, and clear output artifacts.
Technical notes, project breakdowns, reproducible workflows, and deeper implementation details.
Visit technical labAbout
I focus on practical data science: understanding the problem, shaping the data, building the right model, evaluating it honestly, and communicating the result clearly.
My strongest interests are risk modeling, retrieval-augmented generation, synthetic data evaluation, recommender systems, explainability, and analytics systems that help people make better decisions.
“Good data science is not just a model. It is a reliable path from messy evidence to a decision someone can trust.”
Skills
Python, SQL, pandas, NumPy, statistics, exploratory analysis, feature engineering.
Classification, regression, forecasting, validation, metrics, error analysis, interpretation.
Text classification, transformers, RAG, semantic search, embeddings, grounded AI systems.
Plotly, Streamlit, dashboards, storytelling, KPI reporting, decision-support interfaces.
Git, APIs, FastAPI, reproducible pipelines, documentation, clean project structure.
Translating models, uncertainty, tradeoffs, and results for technical and business audiences.
Contact
The fastest way to reach me is through LinkedIn or GitHub. For technical details, visit my lab at honardoust.codes.