Skip to content
View BlasBenito's full-sized avatar

Block or report BlasBenito

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
BlasBenito/README.md

Blas M. Benito, PhD

Senior Data Scientist & Team Lead

Email Website LinkedIn Google Scholar ORCID

About Me

I'm a data science leader with 19+ years of international experience bridging cutting-edge research and production systems. I specialize in translating complex geospatial and machine learning methods into scalable solutions that drive measurable business impact.

πŸ“„ Download my full rΓ©sumΓ© (PDF)

Current Role

Senior Data Scientist & Team Lead at Biome Makers Inc (Spain/US)

  • Built and lead a distributed 4-person data science team with established code quality standards, review procedures, and strategic planning alignment
  • Architected production systems combining high-resolution Earth observation data and machine learning, mapping 200,000+ hectares of soil microbiome and crop disease worldwide
  • Delivered solutions that secured €2M+ in enterprise contracts, directly shaping company strategy and enabling commercial expansion
  • Established operational stability procedures maintaining production system reliability across global deployments

What I Bring

Area Expertise
Leadership Building high-performing teams β€’ Strategic planning β€’ Cross-functional collaboration β€’ Mentorship
Geospatial PostGIS β€’ GDAL/OGR β€’ COG/Zarr/STAC β€’ Earth Observation (Sentinel-1/2, Landsat, SAR)
ML & Analytics Spatial ML β€’ Random Forest β€’ XGBoost β€’ Time series analysis β€’ Predictive modeling
Engineering R β€’ Python β€’ SQL β€’ C++ β€’ CI/CD β€’ Jenkins β€’ Docker β€’ AWS/GCP
Data Systems Pipeline automation β€’ High-performance computing β€’ Large-scale data processing

Career Highlights

Research & Software Development β€’ University of Alicante & Bergen University (2016–2021)

  • Led €199K research project funded by the Norwegian Research Council
  • Published 21 peer-reviewed papers including Most Downloaded Paper and Editor's Pick recognitions
  • Developed open-source tools with 100,000+ combined downloads

Early Career β€’ Spain & Denmark (2006–2016)

  • Secured €850K in competitive grants and government contracts
  • Led research teams developing large-scale geospatial modeling pipelines
  • Patented MODELER, a metadata system for Earth Sciences
  • Authored 26 peer-reviewed publications

Education

  • πŸŽ“ PhD in Computational Ecology β€” University of Granada (2010)
  • 🌍 MSc in Geographic Information Systems (UNIGIS) β€” University of Girona (2009)
  • πŸ“Š MSc in Management and Environmental Auditing β€” University of CΓ‘diz (2006)
  • 🧬 BSc in Biology (Ecology) β€” University of Granada (2003)

Open Source Contributions

R packages I've authored and maintain:

Package Downloads Description
distantia Fast Dynamic Time Warping for time series comparison
spatialRF Spatial modeling with Random Forest
collinear Automated multicollinearity management
memoria Ecological memory analysis in time series
virtualPollen Mechanistic simulation in plant ecology

Let's Connect

I'm always interested in discussing data science, geospatial technology, and opportunities where I can make an impact.

Email LinkedIn Mastodon BlueSky

Popular repositories Loading

  1. spatialRF spatialRF Public

    R package to fit spatial models with Random Forest

    R 117 18

  2. distantia distantia Public

    R package to compute dissimilarity between multivariate time series

    R 24 6

  3. collinear collinear Public

    R package to manage multicollinearity in modeling data frames.

    R 16 1

  4. SDMcourse SDMcourse Public

    Species Distribution Modelling course

    R 14 9

  5. virtualPollen virtualPollen Public

    R package to simulate pollen curves based on virtual taxa with different life and niche traits.

    R 5 1

  6. spatialRF_talk spatialRF_talk Public

    Talk about incorporating spatial autocorrelation into random forest models

    JavaScript 5 1