Blas M. Benito

Blas M. Benito

Senior Data Scientist & Team Lead

AgTech

Bio

I’m a senior data scientist & team lead with 19+ years of international experience bridging cutting-edge research and production systems. I build geospatial and machine learning pipelines into scalable solutions to solve hard problems and drive business impact.

Current Role

I lead data science teams building production systems at the intersection of geospatial technology and machine learning. My recent work has focused on combining high-resolution Earth observation data with predictive modeling to map soil and crop health at scale, delivering solutions that have secured €2M+ in enterprise contracts and mapped over 200,000 hectares worldwide.

Open Source

I develop and maintain R packages for spatial and temporal analysis: distantia (dynamic time warping), spatialRF (spatial machine learning), collinear (multicollinearity management), memoria (ecological memory), and virtualPollen (ecological simulation). Together, these packages have been downloaded over 100,000 times.

Research Background

Before transitioning to industry, I built my expertise in world-class research labs across Spain, Denmark, and Norway. I’ve co-authored 49 peer-reviewed papers with 210 collaborators from 22 countries, with several recognized as “Most Downloaded” and “Editor’s Pick” in their respective journals.

Beyond Work

When I’m not working, I enjoy time with my family, tinkering on the piano, paddleboarding, and developing R packages.

Let’s Connect

I’m always open to discussing data science leadership, geospatial technology, and new opportunities. Connect with me on LinkedIn or drop me an email.

Interests

  • Geospatial Data Science & Engineering
  • Machine Learning & Predictive Modeling
  • Earth Observation & Remote Sensing
  • Production Systems & Pipeline Automation
  • Scientific Software Development
  • Team Leadership & Strategic Planning

Education

  • Ph.D. in Computational Ecology, 2010

    University of Granada

  • MSc in Geographic Information Systems (UNIGIS), 2009

    University of Girona

  • MSc in Management and Environmental Auditing, 2006

    University of Cadiz

  • BSc in Biology (Ecology), 2003

    University of Granada

Resume

Industry

Open PDF
Snapshot Image

Academia

Open PDF
Snapshot Image

Experience

 
 
 
 
 

Senior Data Scientist & Team Lead

Biome Makers Inc.

Jan 2022 – Present Spain / US
  • Built and led a distributed 4-person data science team, establishing code quality standards, review procedures, and aligning planning with strategic objectives.
  • Architected production systems combining high-resolution Earth observation data and machine learning, mapping 200,000+ hectares of soil microbiome and crop disease worldwide.
  • Delivered solutions that secured €2M+ in enterprise contracts, directly shaping company strategy and commercial expansion.
  • Established operational stability procedures maintaining production system reliability across global deployments.
 
 
 
 
 

Staff researcher

University of Alicante

Jan 2020 – Dec 2021 Alicante, Spain
  • Published 15 papers in peer-reviewed international journals.
  • One paper received a “Most Downloaded Paper Award”.
  • Developed the R package spatialRF for spatial modelling, with 13.000 downloads.
 
 
 
 
 

Invited Instructor

Stockholm University

Jan 2017 – Dec 2017 Stockholm, Sweden
  • Designed and instructed the post-graduate course “Practical Introduction to Species Distribution Modelling” (20 hours, 20 students).
 
 
 
 
 

Staff researcher

University of Bergen (Norway)

Sep 2016 – Aug 2019 Bergen, Norway
  • Executed a €199000 project by the Norwegian National Science Foundation (NFR).
  • Published 6 papers in peer-reviewed international journals.
  • One paper received the Most Downloaded Paper Award and was highlighted as an Editor’s Pick by the Journal
  • Developed 3 R packages with a total of 70.000 downloads for time-series comparison and analysis.
  • Enhanced my skill set in time-series analysis and R package development.
 
 
 
 
 

Staff researcher

Aarhus University

Apr 2014 – Sep 2016 Aarhus, Denmark
  • Published 8 papers in peer-reviewed international journals.
  • One paper received the “Most Downloaded Paper Award” and was highlighted as an Editor’s Pick in Science.
  • Executed a €300000 research project by the AU-Ideas Foundation (Aarhus University).
  • Expanded considerably my international collaboration network.
  • Contributed positively to the success of a world-class laboratory.
 
 
 
 
 

Invited Instructor

GBIF.es

Jan 2011 – Dec 2019 Madrid, Spain
  • Designed and instructed 9 editions of the post-graduate course “Workshop in Ecological Niche Modelling” (184 hours, 225 students).
  • My lectures were recorded and made available in the outreach plataform of the National Research Council of Spain (CSIC).
 
 
 
 
 

Staff researcher

Institute for Earth System Research

May 2010 – Apr 2014 Granada, Spain
  • Patented ‘MODELER, a model repository as knowledge based for experts’. Registration code 201299900779498, Expedient Number: GR-188-12.
  • Executed a €12000 contract with the Andalusian Government for plant diversity mapping.
  • Executed a €8000 contract with the Andalusian Government to enhance their Biodiversity Data Infrastructure.
  • Executed a €250000 grant to study the effects of Climate Change on the flora of Sierra Nevada National Park.
  • Published 11 papers in peer-reviewed international journals.
  • Designed and instructed the Master’s course ‘Ecoinformatics’ (60 hours, 25 students).
  • Instructed 3 undergraduate ecology courses (40 hours, 200 students).
  • Enhanced my skills as independent researcher.
  • Participated in an International Consortium to model and protect the plant diversity in Central America.
 
 
 
 
 

PhD student

University of Granada

May 2006 – Apr 2010 Granada, Spain
  • Obtained a 120.000€ PhD grant (Andalusian Government).
  • Completed a PhD in Computational Ecology.
  • Published 9 papers in national and peer-reviewed international journals.
  • Completed a Master’s in Geographic Information Systems.
  • Completed a Master’s en Environmental Auditing.
  • Developed technical skills in GIS and R programming.
  • Designed and instructed a full Master’s course on Geographic Information Systems (60 hours, 25 students).
  • Instructed two undergraduate courses (25 hours, 120 students) .

Skills

R programming

100%

Geographic Information Systems

100%

Data Analysis and Reporting

100%

Remote Sensing

100%

Machine Learning & Statistics

100%

Team leadership

100%

Data Engineering

100%

English

100%

Problem Solving

100%

Communication

100%

Research & Development

100%

Project Management

100%

Software

*

R package collinear

R package for multicollinearity management in data frames with numeric and categorical variables.

R package distantia

R package to compare multivariate time-series with dynamic time warping and lock-step methods.

R package spatialRF

R package for spatial regression with Random Forest

R package memoria

R package to assess ecological memory in multivariate time-series.

R package virtualPollen

R package to simulate pollen production of mono-specific tree populations over millennia.

Recent Publications

Quickly discover relevant content by filtering publications.

Species Distribution Models predict abundance and its temporal variation in a steppe bird population.

Habitat Suitability Index (HSI) derived from Species Distribution Model (SDM) has been used to infer or predict local demographic properties such as abundance for many species. Across species studied, HSI has either been presented as a poor predictor of abundance or as a predictor of potential rather than realized abundance. The main explanation of the lack of relationship between HSI and abundance is that the local abundance of a species varies in time due to various ecological processes that are not integrated into correlative SDM. To better understand the HSI-abundance relationship, in addition to the study of the association between HSI and mean abundance, we explored its variation over time. We used data from 10-years monitoring of a Houbara bustard (Chlamydotis undulata undulata) population in Morocco. From various occurrence data we modelled the HSI. From (independent) count data we calculated four local abundance indices: mean abundance, maximum abundance, the temporal trend of abundance and the coefficient of variation of abundance over the study period. We explored the relationship between HSI and abundance indices using linear, polynomial and quantile regressions. We found a triangular relationship between local abundance (mean and maximum) and HSI, indicating that the upper limit of mean and maximum abundance increased with HSI. Our results also indicate that sites with the highest HSI were associated with least variation in local abundance, the highest variation being observed at intermediate HSI. Our results provide new empirical evidence supporting the generalization of the triangular relationship between HSI and abundance. Overall, our results support the hypothesis that HSI obtained from SDMs can reflect the local abundance potentialities of a species and emphasize the importance of investigating this relationship using temporal variation in abundance.

Human practices behind the aquatic and terrestrial ecological decoupling to climate change in the tropical Andes.

Anthropogenic climate change and landscape alteration are two of the most important threats to the terrestrial and aquatic ecosystems of the tropical Americas, thus jeopardizing water and soil resources for millions of people in the Andean nations. Understanding how aquatic ecosystems will respond to anthropogenic stressors and accelerated warming requires shifting from short-term and static to long-term, dynamic characterizations of human-terrestrial-aquatic relationships. Here we use sediment records from Lake Llaviucu, a tropical mountain Andean lake long accessed by Indigenous and post-European societies, and hypothesize that under natural historical conditions (i.e., low human pressure) vegetation and aquatic ecosystems’ responses to change are coupled through indirect climate influences—that is, past climate-driven vegetation changes dictated limnological trajectories. We used a multi-proxy paleoecological approach including drivers of terrestrial vegetation change (pollen), soil erosion (Titanium), human activity (agropastoralism indicators), and aquatic responses (diatoms) to estimate assemblage-wide rates of change and model their synchronous and asynchronous (lagged) relationships using Generalized Additive Models. Assemblage-wide rate of change results showed that between ca. 3000 and 400 calibrated years before present (cal years BP) terrestrial vegetation, agropastoralism and diatoms fluctuated along their mean regimes of rate of change without consistent periods of synchronous rapid change. In contrast, positive lagged relationships (i.e., asynchrony) between climate-driven terrestrial pollen changes and diatom responses (i.e., asynchrony) were in operation until ca. 750 cal years BP. Thereafter, positive lagged relationships between agropastoralism and diatom rates of changes dictated the lake trajectory, reflecting the primary control of human practices over the aquatic ecosystem prior European occupation. We interpret that shifts in Indigenous practices (e.g., valley terracing) curtailed nutrient inputs into the lake decoupling the links between climate-driven vegetation changes and the aquatic community. Our results demonstrate how rates of change of anthropogenic and climatic influences can guide dynamic ecological baselines for managing water ecosystem services in the Andes.

Density-dependence of reproductive success in a Houbara bustard population.

Although density-dependent processes and their impacts on population dynamics are key issues in ecology and conservation biology, empirical evidence of density-dependence remains scarce for species or populations with low densities, scattered distributions, and especially for managed populations where densities may vary as a result of extrinsic factors (such as harvesting or releases). Here, we explore the presence of density-dependent processes in a reinforced population of North African Houbara bustard (Chlamydotis undulata undulata). We investigated the relationship between reproductive success and local density, and the possible variation of this relationship according to habitat suitability using three independent datasets. Based on eight years of nests monitoring (more than 7000 nests), we modeled the Daily Nest Survival Rate (DNSR) as a proxy of reproductive success. Our results indicate that DNSR was negatively impacted by local densities and that this relationship was approximately constant in space and time: (1) although DNSR strongly decreased over the breeding season, the negative relationship between DNSR and density remained constant over the breeding season; (2) this density-dependent relationship did not vary with the quality of the habitat associated with the nest location. Previous studies have shown that the demographic parameters and population dynamics of the reinforced North African Houbara bustard are strongly influenced by extrinsic environmental and management parameters. Our study further indicates the existence of density-dependent regulation in a low-density, managed population.