Fernando Fernandes Neto (@FernandoNetoAi) / X

Fernando Fernandes Neto

948 posts

Fernando Fernandes Neto

@FernandoNetoAi

Machine Learning and AI researcher wayyy before all this hype.

Joined December 2023

Fernando Fernandes Neto
@FernandoNetoAi
Dec 30, 2023
It seems @huggingface and @MistralAI are sharing some secrets, and I've just found out them over the docs. Love you guys!
34K
Fernando Fernandes Neto
@FernandoNetoAi
Apr 20, 2024
Ladies and Gentlemen! @erhartford , @latkins and me are preparing to let all of you fucking crazy ... THIS IS THE BEST DOLPHIN RELEASE FUCKING EVER!!!!! 8B MODEL BEASTTTTTT
8.3K
Fernando Fernandes Neto
@FernandoNetoAi
Jan 2, 2024
Me and @erhartford are pleased to announce [maybe] the first successful LASER model @ @huggingface. Our model showed superior benchmarks over our latest DPO version of Dolphin finetune over Mistral AI's Mistral 7b. Pt 1
19K
Fernando Fernandes Neto
@FernandoNetoAi
May 22, 2024
Hi, folks! Me, @DavidGFar , @latkins and @erhartford cannot stop inventing new crazy stuff. Now we are delighted to announce Kraken, sponsored by @HyperspaceAI and @VAGOsolutions. (1/N)
13K
Fernando Fernandes Neto
@FernandoNetoAi
Jan 4, 2024
Hi! Me and @erhartford are opensourcing our LaserRMT implementation of the original Laser Paper. We improved the search algorithm by employing random matrix theory and Marchenko-Pastur theory. Let's get loads of models being 'lasered" @huggingface
8.3K
Fernando Fernandes Neto
@FernandoNetoAi
Jan 13, 2024
LLMs and autistic people: For those who doesn't know, I am the father of an autistic child and an LLM researcher. One thing has caught my attention about a slight similarity between the behavior of autistic children and LLMs; and eventually our definition of intelligence. (1/4)
10K
Fernando Fernandes Neto
@FernandoNetoAi
Jan 15, 2024
Me, @erhartford and David Golchinfar are pleased to announce our new model. Cognitive Computations - Laserxtral 4x7B. This is basically a MoE done using the mergekit provided by Charles Goddard. This model exhibts strong reasoning capabilities and truthfulness. (1/3)
6.2K
Fernando Fernandes Neto
@FernandoNetoAi
Mar 5, 2024
This was a very smart trick we have had with @erhartford . We have created a small HF Transformer = PyTorch hack to enable an "online passthrough" frankenmerge that loops in the forward method. Hence we have the same model results, but way less vRAM use. We are excited! (1/2)
Eric Hartford
@QuixiAI
Mar 5, 2024
@DavidGFar @FernandoNetoAi congratulations David and Fernando on the release of Dolphin-phi-kensho!
5K
Fernando Fernandes Neto
@FernandoNetoAi
Feb 23, 2024
After some small pushing from @ivanfioravanti , we (me, @erhartford and @DavidGFar) are just releasing scripts for laserRMT compatible to MPS. So now modelers can scan their models and laser them. Thanks @HyperspaceAI and @VAGOsolutions for the support.
GitHub - QuixiAI/laserRMT: This is our own implementation of 'Layer Selective Rank Reduction'
From github.com
3.6K
Fernando Fernandes Neto
@FernandoNetoAi
May 22, 2024
Replying to @FernandoNetoAi
... Yes you can. You can mixup whatever you can. And we are open sourcing the whole pipeline to achieve that as well. Welcome to Kraken! [GitHub]: github.com/cognitivecompu… [Demo Model]: huggingface.co/cognitivecompu…
GitHub - QuixiAI/kraken
From github.com
995
Fernando Fernandes Neto
@FernandoNetoAi
Jun 6, 2024
Now it is OFFICIAL! BTW, it's MMLU score is VERY close to gpt4 (86.9) I don't wanna talk too much, but this is the SOTA in open source models. So glad to be working with Eric and @latkins on enabling this. Thanks @Alibaba_Qwen for the excellent base model!
Eric Hartford
@QuixiAI
Jun 6, 2024
Cognitive Computations presents Dolphin-2.9.2-Qwen2-72b. The best Dolphin ever. Thanks to @Alibaba_Qwen for the excellent base model! 83.9 mmlu and 128k context! New in 2.9.2 is SystemChat - A dataset designed to teach the model to obey the system prompt, even over a long
1.9K
Fernando Fernandes Neto
@FernandoNetoAi
Jan 2, 2024
Replying to @FernandoNetoAi @pratyusha_PS and @MIT
Available @
dphn/dolphin-2.6-mistral-7b-dpo-laser · Hugging Face
From huggingface.co
2.4K
Fernando Fernandes Neto
@FernandoNetoAi
Feb 12, 2024
Me, @DavidGFar and @erhartford are proud to share our new notebook (Laser Qlora). How can we spot layers that are more prone to absorb new knowledge and continue further fine-tuning a pre-existing sft model??? Thanks @HyperspaceAI and @VAGOsolutions for supporting. (Link below)
4K
Fernando Fernandes Neto
@FernandoNetoAi
Jan 29, 2024
And the best 7b Model @ HF leaderboard is a LaserRMT one <3 ... Feeling proud with @erhartford and @DavidGFar ... Congratulations for Tim Dollan!
1.9K