1 min readfrom Machine Learning

I fine-tuned an LLM to be C-3PO to test which training data format works best for persona injection [P]

I fine-tuned an LLM to be C-3PO to test which training data format works best for persona injection [P]
I fine-tuned an LLM to be C-3PO to test which training data format works best for persona injection [P]

Tested three formats: chat demos, first-person statements ("I am C-3PO..."), and synthetic Wikipedia-style docs. Same model, same LoRA config, 500 examples each.

First-person statements won on generalization, which I didn't expect. The synthetic doc model was the weirdest result: it knew C-3PO was anxious but only expressed it 37% of the time. Knowing a trait vs feeling it are apparently different things in weight space.

Code and GitHub repo link are included inside!

submitted by /u/Georgiou1226
[link] [comments]

Want to read more?

Check out the full article on the original site

View original article

Tagged with

#generative AI for data analysis
#Excel alternatives for data analysis
#real-time data collaboration
#rows.com
#natural language processing for spreadsheets
#big data management in spreadsheets
#conversational data analysis
#intelligent data visualization
#data visualization tools
#enterprise data management
#big data performance
#data analysis tools
#data cleaning solutions
#no-code spreadsheet solutions
#real-time collaboration
#C-3PO
#fine-tuned
#first-person statements
#persona injection
#generalization