My latest blog post showcases a minimalistic approach for training text generation architectures from @huggingface with @TensorFlow and Keras as the backend. I am utilizing @Google's mT5 model in a use case from a @kaggle competition. Read it here: medium.com/@radicho/fine-…
Radostin Cholakov
591 posts
CS @Stanford
Stanford, California
Joined March 2017
- Building an open-source command line tool for generating datasets with LLMs. It could be used either to conduct research about the LLM itself (analyze texts, train detectors) or to leverage powerful models (e.g., GPT-3.5) for tuning smaller ones. github.com/radi-cho/datas…
- Starting today I am officially part of the @GoogleDevExpert program as a Machine Learning GDE! This motivated me to begin tweeting daily about AI/ML and tech in general. Hope more people will be impacted with time.
GIF - Next month I will talk about the @TensorFlow TF-DF library and how it can be utilized to combine the strengths of neural networks with decision forest algorithms. In collaboration with @gdg_sofia @gdgbsas @GoogleDevExpert gdg.community.dev/events/details…
- As an experiment, I am building diffground.com - a simplistic UI to edit images with Stable Diffusion and InstructPix2Pix. It is built with @FlutterDev and is coming to @Android soon (together with more models and editing options).
- Today I presented our work titled "Efficient Task-Oriented Dialogue Systems with Response Selection as an Auxiliary Task" at @ICNLSP
- With the latexify_py package from @Google you can easily generate LaTeX expressions from Python code. github.com/google/latexif…
- I tried to get assistance with some research work from Gemini 1.5 Pro with a context length of 1 million tokens. Here are some takeaways:
- 📰 Today, we release ImagiNet, a high-resolution balanced dataset for synthetic image detection. It comes with a strong baseline based on self-contrastive learning, which achieves state-of-the-art results on multiple benchmarks. Some details: - Includes 200K high-resolution
- We live in a crazy world where prompt marketplaces for pre-trained models are now a thing!You can now buy and sell prompts for generative models: DALL·E, GPT-3, Mid journey, and Stable Diffusion. I'm not sure if "prompt engineering" is a long-term thing, but it's amazing to see how this is a new avenue for creators to make money: promptbase.com
- Had a great time with @mervenoyann and many other developers & ML people at yesterday's #GoogleIOConncet community dinner!
- lm-eval-harness with the exact same config gives wildly inconsistent results depending on which GPU you use (RTX 4090, A100, H100)??? 🥲 there should really be a better way to benchmark LLMs!
- Thanks to the @GoogleDevEurope team for the recognition during I/O Connect Amsterdam, even though I wasn't able to participate in person this time. Also, currently preparing a new @GoogleDevExpert talk that I hope to present during the #DevFest season.


















