<?xml version="1.0" encoding="utf-8" standalone="yes" ?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom">
  <channel>
    <title>Posts on DEEM Lab</title>
    <link>https://deem.berlin/post/</link>
    <description>Recent content in Posts on DEEM Lab</description>
    <generator>Hugo -- gohugo.io</generator>
    <language>en-us</language>
    <copyright>&amp;copy; 2024</copyright>
    <lastBuildDate>Sun, 15 Mar 2026 00:00:00 +0100</lastBuildDate>
    
	<atom:link href="https://deem.berlin/post/index.xml" rel="self" type="application/rss+xml" />
    
    
    <item>
      <title>SemBench accepted at VLDB</title>
      <link>https://deem.berlin/post/2026-03-15-sembench/</link>
      <pubDate>Sun, 15 Mar 2026 00:00:00 +0100</pubDate>
      
      <guid>https://deem.berlin/post/2026-03-15-sembench/</guid>
      <description>The benchmark paper &lt;a href=&#34;https://deem.berlin/publication/2026-03-15-sembench-a-benchmark-for-semantic-query-processing-engines/&#34; target=&#34;_blank&#34;&gt;SemBench: A Benchmark for Semantic Query Processing Engines&lt;/a&gt; has been accepted for publication at VLDB&amp;rsquo;26. This is the result of a  collaboration between Google&amp;rsquo;s BigQuery team and researchers from Cornell, MIT, UTN, UMichigan, and our lab, driven by Olga from our side. Checkout &lt;a href=&#34;https://sembench.org/&#34; target=&#34;_blank&#34;&gt;sembench.org&lt;/a&gt; for the latest leaderboard and results!</description>
    </item>
    
    <item>
      <title>SIGMOD Research Highlight Award for Arnab</title>
      <link>https://deem.berlin/post/2025-12-25-arnab-award/</link>
      <pubDate>Tue, 16 Dec 2025 00:00:00 +0100</pubDate>
      
      <guid>https://deem.berlin/post/2025-12-25-arnab-award/</guid>
      <description>Arnab’s 2025 paper, &lt;a href=&#34;https://openproceedings.org/2025/conf/edbt/paper-82.pdf&#34; target=&#34;_blank&#34;&gt;MEMPHIS: Holistic Lineage-based Reuse and Memory Management for Multi-backend ML Systems&lt;/a&gt;, has received the prestigious &lt;strong&gt;SIGMOD Research Highlight Award&lt;/strong&gt; and will appear in the March 2026 special issue of SIGMOD Record. The paper has previously won the EDBT 2025 Best Research Paper Award.</description>
    </item>
    
    <item>
      <title>DEEM at RecSys 2025</title>
      <link>https://deem.berlin/post/2025-11-03-barrie-tors/</link>
      <pubDate>Mon, 03 Nov 2025 00:00:00 +0100</pubDate>
      
      <guid>https://deem.berlin/post/2025-11-03-barrie-tors/</guid>
      <description>Barrie has been invited to submit an extended version of his paper on &lt;a href=&#34;https://deem.berlin/publication/2025-07-03-scalable-data-debugging-for-neighborhood-based-recommendation-with-data-shapley-values-recsys/&#34; target=&#34;_blank&#34;&gt;Scalable Data Debugging for Neighborhood-based Recommendation with Data Shapley Values&lt;/a&gt; to the &lt;strong&gt;Special Issue on Highlights of RecSys ’25&lt;/strong&gt; of the &lt;a href=&#34;https://dl.acm.org/journal/tors&#34; target=&#34;_blank&#34;&gt;ACM Transactions on Recommender Systems&lt;/a&gt;.</description>
    </item>
    
    <item>
      <title>DEEM at RecSys 2025</title>
      <link>https://deem.berlin/post/2025-09-26-deem-at-recsys/</link>
      <pubDate>Fri, 26 Sep 2025 00:00:00 +0200</pubDate>
      
      <guid>https://deem.berlin/post/2025-09-26-deem-at-recsys/</guid>
      <description>Barrie and Pierre are at &lt;a href=&#34;https://recsys.acm.org/recsys25/&#34; target=&#34;_blank&#34;&gt;RecSys&lt;/a&gt; in Prague this week! Barrie presents our work on on &lt;a href=&#34;https://deem.berlin/publication/2025-07-03-scalable-data-debugging-for-neighborhood-based-recommendation-with-data-shapley-values-recsys/&#34; target=&#34;_blank&#34;&gt;Scalable Data Debugging for Neighborhood-based Recommendation with Data Shapley Values&lt;/a&gt;, which was selected as a spotlight oral. Pierre gives a talk on his initial ideas &lt;a href=&#34;https://deem.berlin/publication/2025-08-14-towards-a-real-world-aligned-benchmark-for-unlearning-in-recommender-systems-facctrec-recsys/&#34; target=&#34;_blank&#34;&gt;Towards a Real-World Aligned Benchmark for Unlearning in Recommender Systems&lt;/a&gt; at the Responsible recommendation workshop.</description>
    </item>
    
    <item>
      <title>DEEM at ICML 2025</title>
      <link>https://deem.berlin/post/2025-07-15-deem-at-icml/</link>
      <pubDate>Tue, 15 Jul 2025 00:00:00 +0200</pubDate>
      
      <guid>https://deem.berlin/post/2025-07-15-deem-at-icml/</guid>
      <description>Meet Olga from our lab at &lt;a href=&#34;https://icml.cc/virtual/2025&#34; target=&#34;_blank&#34;&gt;ICML&lt;/a&gt; in Vancouver this week! She will present a research paper on &lt;a href=&#34;https://deem.berlin/publication/2025-03-22-scssl-bench-benchmarking-self-supervised-learning-for-single-cell-data/&#34; target=&#34;_blank&#34;&gt;scSSL-Bench: Benchmarking Self-Supervised Learning for Single-Cell Data&lt;/a&gt;, which was selected as a spotlight poster, as well as a second paper on &lt;a href=&#34;https://deem.berlin/publication/2025-06-09-towards-cross-modal-error-detection-with-tables-and-images-dataworld/&#34; target=&#34;_blank&#34;&gt;Towards Cross-Modal Error Detection with Tables and Images&lt;/a&gt; at the DataWorld workshop.</description>
    </item>
    
    <item>
      <title>Zeyu interns with Amazon Q</title>
      <link>https://deem.berlin/post/2025-07-14-zeyu-amazon-q/</link>
      <pubDate>Mon, 14 Jul 2025 00:00:00 +0200</pubDate>
      
      <guid>https://deem.berlin/post/2025-07-14-zeyu-amazon-q/</guid>
      <description>Our external PhD student Zeyu from the University of Amsterdam is starting his internship with the &lt;a href=&#34;https://aws.amazon.com/q/&#34; target=&#34;_blank&#34;&gt;Amazon Q&lt;/a&gt; team of AWS Berlin this week!</description>
    </item>
    
    <item>
      <title>Barrie Kersbergen defended his PhD</title>
      <link>https://deem.berlin/post/2025-07-07-barrie-defended/</link>
      <pubDate>Mon, 07 Jul 2025 00:00:00 +0200</pubDate>
      
      <guid>https://deem.berlin/post/2025-07-07-barrie-defended/</guid>
      <description>Our external PhD Student &lt;a href=&#34;https://bkersbergen.github.io&#34; target=&#34;_blank&#34;&gt;Barrie Kersbergen&lt;/a&gt; (co-supervised with &lt;a href=&#34;https://staff.fnwi.uva.nl/m.derijke/&#34; target=&#34;_blank&#34;&gt;Maarten de Rijke&lt;/a&gt;) has successfully defended his PhD at the University of Amsterdam! Barrie&amp;rsquo;s research on recommender systems has been deployed to millions of users at the European e-commerce platform &lt;a href=&#34;https://www.bol.com&#34; target=&#34;_blank&#34;&gt;bol.com&lt;/a&gt;.</description>
    </item>
    
    <item>
      <title>DEEM at SIGMOD 2025</title>
      <link>https://deem.berlin/post/2025-06-18-deem-at-sigmod/</link>
      <pubDate>Tue, 10 Jun 2025 00:00:00 +0200</pubDate>
      
      <guid>https://deem.berlin/post/2025-06-18-deem-at-sigmod/</guid>
      <description>Meet our lab at the &lt;a href=&#34;https://2025.sigmod.org/index.shtml&#34; target=&#34;_blank&#34;&gt;SIGMOD conference&lt;/a&gt; in Berlin next week! We are part of the &lt;a href=&#39;https://2025.sigmod.org/org_conference_officers.shtml&#39; target+&#39;_blank&#39;&gt;organizing committee&lt;/a&gt; of the conference and co-organise the &lt;a href=&#39;https://deem-workshop.github.io&#39; target+&#39;_blank&#39;&gt;DEEM workshop&lt;/a&gt; as well. Furthermore, we will present a workshop paper on &lt;a href=&#39;http://localhost:1313/pdf/tadv-deem.pdf&#39; target+&#39;_blank&#39;&gt;Towards Automated Task-Aware Data Validation&lt;/a&gt; and run a tutorial on &lt;a href=&#39;https://navigating-data-errors.github.io&#39; target+&#39;_blank&#39;&gt;Navigating Data Errors in Machine Learning Pipelines&lt;/a&gt; on Friday.</description>
    </item>
    
    <item>
      <title>Participation in the TRL seminar in Dagstuhl</title>
      <link>https://deem.berlin/post/2025-05-02-trl-dagstuhl/</link>
      <pubDate>Fri, 02 May 2025 00:00:00 +0200</pubDate>
      
      <guid>https://deem.berlin/post/2025-05-02-trl-dagstuhl/</guid>
      <description>Olga and Sebastian took part in the seminar on the &lt;a href=&#34;https://www.dagstuhl.de/en/seminars/seminar-calendar/seminar-details/25182&#34; target=&#34;_blank&#34;&gt;Challenges and Opportunities of Table Representation Learning&lt;/a&gt; in Dagstuhl, which aims to connect the communities of data management, machine learning, and natural language processing to discuss the future of learning on tabular data.</description>
    </item>
    
    <item>
      <title>Two talks at EDBT</title>
      <link>https://deem.berlin/post/2025-03-25-zeyu-edbt/</link>
      <pubDate>Tue, 25 Mar 2025 00:00:00 +0100</pubDate>
      
      <guid>https://deem.berlin/post/2025-03-25-zeyu-edbt/</guid>
      <description>Zeyu gave an invited talk about the efficient utilization of language models for table data preparation at the industry event on &lt;a href=&#34;https://edbticdt2025.upc.edu/?contents=next_generation_dms.html&#34; target=&#34;_blank&#34;&gt;Next-Generation Data Management Systems&lt;/a&gt; at &lt;a href=&#34;https://edbticdt2025.upc.edu/&#34; target=&#34;_blank&#34;&gt;EDBT 2025&lt;/a&gt; in Barcelona, and subsequently presented our paper on &lt;a href=&#34;publication/2025-02-05-a-deep-dive-into-cross-dataset-em-with-small-and-large-language-models-edbt/&#34; target=&#34;_blank&#34;&gt;A Deep Dive Into Cross-Dataset Entity Matching with Large and Small Language Models&lt;/a&gt;.</description>
    </item>
    
    <item>
      <title>The EU AI Act -- Developing a technical perspective</title>
      <link>https://deem.berlin/post/2025-01-11-deem-sigmod/</link>
      <pubDate>Sat, 11 Jan 2025 00:00:00 +0100</pubDate>
      
      <guid>https://deem.berlin/post/2025-01-11-deem-sigmod/</guid>
      <description>Stefan will be co-organising the workshop on &lt;a href=&#34;https://deem-workshop.github.io/&#34; target=&#34;_blank&#34;&gt;Data Management for End-to-End Machine Learning (DEEM)&lt;/a&gt; at &lt;a href=&#34;https://2025.sigmod.org&#34; target=&#34;_blank&#34;&gt;SIGMOD 2025&lt;/a&gt; in Berlin.</description>
    </item>
    
    <item>
      <title>The EU AI Act -- Developing a technical perspective</title>
      <link>https://deem.berlin/post/2024-12-03-eu-ai-act-workshop/</link>
      <pubDate>Tue, 03 Dec 2024 00:00:00 +0100</pubDate>
      
      <guid>https://deem.berlin/post/2024-12-03-eu-ai-act-workshop/</guid>
      <description>We have been co-organising a workshop on &lt;a href=&#34;https://www.linkedin.com/posts/bifoldberlin_on-friday-november-29-2024-bifold-researchersprof-activity-7269591229674237952-0VbM&#34; target=&#34;_blank&#34;&gt;&amp;lsquo;The EU AI Act &amp;ndash; Developing a technical perspective&amp;rsquo;&lt;/a&gt; together with our colleagues from machine learning and law as well as industry practitioners.</description>
    </item>
    
    <item>
      <title>Interview with AI Berlin</title>
      <link>https://deem.berlin/post/2024-10-10-ai-berlin/</link>
      <pubDate>Thu, 10 Oct 2024 00:00:00 +0200</pubDate>
      
      <guid>https://deem.berlin/post/2024-10-10-ai-berlin/</guid>
      <description>Our research group has been covered in an &lt;a href=&#34;https://ai-berlin.com/blog/article/prof-dr-sebastian-schelter-research-group-lead-bifold-chair-of-deem-lab&#34; target=&#34;_blank&#34;&gt;interview on the #ai_berlin website&lt;/a&gt;.</description>
    </item>
    
    <item>
      <title>Meet us at VLDB in Chian</title>
      <link>https://deem.berlin/post/2024-08-01-vldb/</link>
      <pubDate>Sat, 01 Jun 2024 00:00:00 +0200</pubDate>
      
      <guid>https://deem.berlin/post/2024-08-01-vldb/</guid>
      <description>We will be present at the upcoming &lt;a href=&#34;https://vldb.org/2024/&#34; target=&#34;_blank&#34;&gt;VLDB&lt;/a&gt; conference in China with several contributions: &lt;ul style=&#39;margin-top: 10px;margin-left: 0;padding-left:20px;&#39;&gt;&lt;li&gt;&lt;a href=&#34;https://hpi.de/naumann/projects/conferences-and-workshops-hosted/qdb-2024.html&#34; target=&#34;_blank&#34;&gt;How Data Management Research Helps to Improve Real World ML Applications&lt;/a&gt; (Keynote at the workshop on Quality in Databases)&lt;/li&gt;&lt;li&gt;&lt;a href=&#34;publication/a-flexible-forecasting-stack-vldb/&#34; target=&#34;_blank&#34;&gt;A Flexible Forecasting Stack&lt;/a&gt; (Industry paper with AWS)&lt;/li&gt;&lt;li&gt;&lt;a href=&#34;https://deem.berlin/publication/2024-05-28-snapcase-regain-control-over-your-predictions-with-low-latency-machine-unlearning-vldb-demo/&#34;&gt;Snapcase - Regain Control over Your Predictions with Low-Latency Machine Unlearning&lt;/a&gt; (Demonstration)&lt;/li&gt;&lt;li&gt;&lt;a href=&#34;https://deem.berlin/publication/instrumentation-and-analysis-of-native-ml-pipelines-via-logical-query-plans-vldb-phd/&#34;&gt;Instrumentation and Analysis of Native ML Pipelines via Logical Query Plans&lt;/a&gt; (Paper at the PhD workshop)&lt;/li&gt;&lt;li&gt;&lt;a href=&#34;publication/assisted-design-of-data-science-pipelines-vldbj/&#34; target=&#34;_blank&#34;&gt;Assisted design of data science pipelines&lt;/a&gt; (Poster of a VLDBJ publication)&lt;/li&gt;&lt;/ul&gt;</description>
    </item>
    
    <item>
      <title>Two papers at SIGMOD in Chile</title>
      <link>https://deem.berlin/post/2024-06-01-sigmod-chile/</link>
      <pubDate>Sat, 01 Jun 2024 00:00:00 +0200</pubDate>
      
      <guid>https://deem.berlin/post/2024-06-01-sigmod-chile/</guid>
      <description>At the upcoming &lt;a href=&#34;https://2024.sigmod.org/&#34; target=&#34;_blank&#34;&gt;SIGMOD conference&lt;/a&gt; in Chile, Till will present our paper on &lt;a href=&#34;https://dl.acm.org/doi/10.1145/3654975&#34; target=&#34;_blank&#34;&gt;SchemaPile: A Large Collection of Relational Database Schemas&lt;/a&gt;. SchemaPile is a corpus of more than 200 thousand database schemas, which we envision to be a great resource for ML models dealing with structured data, e.g., in data integration tasks. Furthermore, Stefan will present his initial ideas on &lt;a href=&#34;publication/2024-04-28-towards-interactively-improving-ml-data-preparation-code-via-shadow-pipelines-deem/&#34; target=&#34;_blank&#34;&gt;Interactively Improving ML Data Preparation Code via &amp;lsquo;Shadow Pipelines&amp;rsquo;&lt;/a&gt; at the &lt;a href=&#34;https://deem-workshop.github.io&#34; target=&#34;_blank&#34;&gt;DEEM workshop&lt;/a&gt;.</description>
    </item>
    
    <item>
      <title>Two papers at ICDE</title>
      <link>https://deem.berlin/post/2024-05-12-icde-utrecht/</link>
      <pubDate>Sun, 12 May 2024 00:00:00 +0200</pubDate>
      
      <guid>https://deem.berlin/post/2024-05-12-icde-utrecht/</guid>
      <description>Barrie and Zeyu will present two papers at the &lt;a href=&#34;https://icde2024.github.io/&#34; target=&#34;_blank&#34;&gt;International Conference on Data Engineering (ICDE)&lt;/a&gt; in Utrecht. Barrie will discuss how to choose &lt;a href=&#34;publication/etude-evaluating-the-inference-latency-of-session-based-recommendation-models-at-scale-icde/&#34; target=&#34;_blank&#34;&gt;cost-efficient deployment options for neural recommendation models&lt;/a&gt; in e-commerce, while Zeyu will present initial ideas for &lt;a href=&#34;publication/directions-towards-efficient-data-wrangling-with-llms-dbml-icde/&#34; target=&#34;_blank&#34;&gt;zero-shot entity matching&lt;/a&gt;.</description>
    </item>
    
    <item>
      <title>Poster at ICLR</title>
      <link>https://deem.berlin/post/2024-05-12-datascope-iclr/</link>
      <pubDate>Mon, 06 May 2024 00:00:00 +0200</pubDate>
      
      <guid>https://deem.berlin/post/2024-05-12-datascope-iclr/</guid>
      <description>The &amp;lsquo;DataScope&amp;rsquo; paper on &lt;a href=&#34;publication/canonpipe-data-debugging-with-shapley-importance-over-machine-learning-pipelines/&#34; target=&#34;_blank&#34;&gt;debugging ML pipelines via Shapley importance&lt;/a&gt; will be presented at the &lt;a href=&#34;https://iclr.cc/&#34; target=&#34;_blank&#34;&gt;International Conference on Learning Representations (ICLR)&lt;/a&gt; in Vienna. This work was driven by &lt;a href=&#34;https://bojan.ninja&#34; target=&#34;_blank&#34;&gt;Bojan Karlaš&lt;/a&gt; from Harvard and &lt;a href=&#34;https://zhangce.github.io&#34; target=&#34;_blank&#34;&gt;Ce Zhang&lt;/a&gt; from the University of Chicago.</description>
    </item>
    
    <item>
      <title>Action Editor at DMLR</title>
      <link>https://deem.berlin/post/2023-10-18-dmlr/</link>
      <pubDate>Wed, 18 Oct 2023 00:00:00 +0200</pubDate>
      
      <guid>https://deem.berlin/post/2023-10-18-dmlr/</guid>
      <description>Prof. Schelter joins the newly formed &lt;a href=&#34;https://data.mlr.press/&#34; target=&#34;_blank&#34;&gt;Journal of Data-centric Machine Learning Research&lt;/a&gt; (DMLR) as an Action Editor.</description>
    </item>
    
  </channel>
</rss>