Xiaotian (Max) Han (@XiaotianHan1) / X

Xiaotian (Max) Han

185 posts

Xiaotian (Max) Han

@XiaotianHan1

Training LLM

Palo Alto

Joined October 2018

Xiaotian (Max) Han
@XiaotianHan1
Jan 7, 2024
With the recent release of #TinyLlama, SLMs have attracted a lot of attention. I re-released my previously trained SLM - LiteLlama under the MIT license, which has 460M parameters trained with 1T tokens. I hope to contribute a bit to the community.
ahxt/LiteLlama-460M-1T · Hugging Face
From huggingface.co
8.8K
Xiaotian (Max) Han
@XiaotianHan1
Nov 26, 2023
🧑‍💻Spent <2 hrs learning Typst,@typstapp, a LaTeX alternative, and moved my CV (thx @mengliu_1998 template) to it. Compared to LaTeX's steeper learning curve, I'm really impressed with this lightweight, yet powerful, new typesetting system and will use it for non-paper things.💯✍️
6K
Xiaotian (Max) Han
@XiaotianHan1
Nov 28, 2024
It will be a special experience to present our paper as an @TmlrOrg Track Oral Presentation this Thanksgiving night at @LogConference. Grateful to the organizers for scheduling this at such a special time. Join us if you're interested and available, and Happy Thanksgiving!
Xiaotian (Max) Han
@XiaotianHan1
Sep 24, 2024
🤔Sharing some old thoughts that graph convolution is equivalent to Mixup tx.ag/gcnmixup It was previously believed that the effectiveness of graph convolution stems from neighbor aggregation, which enhances or enriches the representation of the target node. However,
4K
Xiaotian (Max) Han
@XiaotianHan1
Sep 6, 2023
🔥Thrilled to train a lite LLaMa_1 and a lite LLaMa_2, now available on Huggingface! While they might not be the large models, they were trained across multiple nodes. First step into the world of LLM 🤗. View the training loss curve
links-cdn.wandb.ai
reduced_train_loss (23/09/05 20:48:01)
6.2K
Xiaotian (Max) Han
@XiaotianHan1
Oct 25, 2023
Super excited and deeply grateful to receive the #NeurIPS2023 Scholar Award to support my attending for the first time ever. Thanks to the great organizers, who are not average at all. See you all in New Orleans!
Rosanne Liu
@savvyRL
Oct 22, 2023
Good day! Average researcher here going through thousands of NeurIPS financial aid applications trying to roll out all decisions by Monday. (Some of you should have already received decisions; others: thanks for your patience!)
9.7K
Xiaotian (Max) Han
@XiaotianHan1
Jun 21, 2023
📢 Looking for easy-to-use fairness baselines? Curious about utility-fairness trade-off control? Unsure about training endpoints? Check out our new benchmark paper for answers!👇 Code: github.com/ahxt/fair_fair… Paper: arxiv.org/abs/2306.09468 #AI #MachineLearning #Fairness
GitHub - ahxt/fair_fairness_benchmark: FFB: A Fair Fairness Benchmark for In-Processing Group...
From github.com
9.2K
Xiaotian (Max) Han
@XiaotianHan1
May 25, 2024
Our SelfExtend (#ICML2024) was highlighted in a Google I/O session at youtu.be/TV7qCk1dBWA?t=… to demonstrate the long-context ability of Gemma. SelfExtend is already a go-to method to extend the context window not only for Gemma, but also for Llama, Mistral, and more. SelfExtend
François Chollet
@fchollet
May 19, 2024
If you missed it, this I/O session on LLMs with Keras 3 is a great tutorial on LLM training and fine-tuning best practices youtu.be/TV7qCk1dBWA
Large language models with Keras
From youtube.com
3.4K
Xiaotian (Max) Han
@XiaotianHan1
Jan 18, 2024
Thrilled to share that this paper has been accepted by #ICLR2024! It offers a range of user-friendly fairness methods, metrics, and datasets. Please try them out! We hope this project can facilitate fairness research and welcome contributions of new fairness algorithms!
Xiaotian (Max) Han
@XiaotianHan1
Jun 21, 2023
📢 Looking for easy-to-use fairness baselines? Curious about utility-fairness trade-off control? Unsure about training endpoints? Check out our new benchmark paper for answers!👇 Code: github.com/ahxt/fair_fair… Paper: arxiv.org/abs/2306.09468 #AI #MachineLearning #Fairness
2.8K
Xiaotian (Max) Han
@XiaotianHan1
Jan 25, 2025
🚀 New Research: Thinking Preference Optimization! We boost LLM reasoning by using long CoT as preferred examples & short CoT as rejected in DPO training. ✨ Key insight: Careful curation of long/short CoT pairs enhances reasoning ability. tx.ag/ttpo
1.3K
Xiaotian (Max) Han
@XiaotianHan1
Dec 4, 2023
Excited to attend #NeurIPS2023 from Dec 9-15! Can't wait to reconnect and meet new minds. 🎓I am on the academic job market for 2023-2024 and am keen on discussing opportunities! ahxt.github.io #academicjobs #openrank #tenuretrack
2.9K
Xiaotian (Max) Han
@XiaotianHan1
Feb 21, 2025
We introduce Thinking Preference Optimization (ThinkPO)—a simple yet effective post-SFT method that enhances long CoT reasoning without requiring new long CoT responses. Instead, ThinkPO leverages existing short CoT responses as rejected answers and long CoT responses as chosen
2.7K
Xiaotian (Max) Han
@XiaotianHan1
Apr 6, 2024
SelfExtend, without further training, upgrades Mistral-inst-v0.1 to match the performance level of its successor, v0.2, in qa tasks. therefore, the value of SelfExtend is at least equivalent to the training cost of Mistral-inst-v0.2?
1.8K
Xiaotian (Max) Han
@XiaotianHan1
Jan 20, 2024
Replying to @cwolferesearch
Thanks for sharing!!! Thanks!! Please see our repo for the simple implementation github.com/datamllab/Long…
960
Xiaotian (Max) Han
@XiaotianHan1
Feb 28, 2024
Curious if LLM architecture improves over time? 🤔 We conducted a preliminary experiment comparing training loss curves for different architectures. To ensure a fair (relatively) comparison, we use 1) the same (almost) size parameter 2) the same training data 3) the same training
1.8K