user avatar
Reynold Xin
@rxin
Cofounder @Databricks
San Francisco, CA
Joined November 2008
Posts
  • Pinned
    user avatar
    Oracle has spent the last two weeks writing articles comparing Oracle (and PDB) to Lakebase, and it highlights a massive philosophical divide in how we view databases in the agentic era. They are trying to retrofit heavy, traditional architectures for AI. We believe Lakebase are
  • user avatar
    Databricks has 4 immigrant founders. We moved to the US for the opportunity to work with the best minds in the world. Our projects power all sectors of the US economy. We employ 1400 people & growing, most in the US. None of these would've happened if we were to start in 2020.
  • user avatar
    Today, more than 80% of the table metadata updates on Databricks are AI-assisted. It took 2 engineers, 1 month, and less than $1000 in compute cost to develop the custom LLM for this task.
  • user avatar
    The DeWitt clause is anti-competitive and we should foster competition and progress. At @databricks, we are removing the DeWitt Clause from our service terms to encourage open benchmarks and research, and calling upon the rest of the industry to follow.
  • user avatar
    The Unreasonable Effectiveness of Deep Learning on Spark databricks.com/blog/2016/04/0… This is super exciting - next major chapter in big data.
  • user avatar
    Apache Spark has won the SIGMOD Systems Award, and Photon won the Best Industry Paper Award! This is a great testament to the adoption and impact of these two systems. If you are at SIGMOD this wk, please come by and say hi.
  • user avatar
    Delta Lake is now part of the Linux Foundation! EBs of data/month, in production 1000s of organizations. Can't wait to see how the community will shape its future and establish it as a standard for data lakes. databricks.com/blog/2019/10/1…
  • user avatar
    "We stand by our blog post and the results: Databricks SQL provides superior performance and price performance over Snowflake, even on data warehousing workloads (TPC-DS)."
  • user avatar
    English is the new programming language. Generative AI is the new compiler. Python is the new byte code.
  • user avatar
    Want to write your Spark job in C#? Microsoft just published Spark .NET CLR language binding spark-packages.org/package/skaart…
  • user avatar
    The best data warehouse is a lakehouse! Databricks SQL (our data warehousing product on top of the Lakehouse) became GA 2.5 yrs ago, and just crossed $400m ARR. One of the fastest growing enterprise software product ever.
    Databricks revealed some sensational growth this week, as they did last year. Exiting this quarter to $2.4 billion annual run rate, the company’s revenue growth is accelerated year-over-year by 10 percentage points. Net dollar retention is a major driver of growth at 140%, which
  • user avatar
    Apache Spark 2.0.2 is released. Update if you are on any 2.0.x version. spark.apache.org/downloads.html
  • user avatar
    Spark "hall of fame": 8000+ nodes in a single cluster, 1PB+/day ingest, mapping the brain at scale, shuffling 1PB slideshare.net/databricks/lar…
  • user avatar
    Databricks SQL Python connector is here! Connect to Databricks SQL in your applications just got much easier. sprou.tt/1K9JPGe3VcH