•1 min read•from Machine Learning
I fine-tuned an LLM to be C-3PO to test which training data format works best for persona injection [P]
![I fine-tuned an LLM to be C-3PO to test which training data format works best for persona injection [P]](/_next/image?url=https%3A%2F%2Fexternal-preview.redd.it%2F1C7DaY19KuG89mhvojvL6_8K06VFLwcmz6YoskyYn3o.jpeg%3Fwidth%3D640%26crop%3Dsmart%26auto%3Dwebp%26s%3D2086a42057db58a396c8024e3b169eddade458dd&w=3840&q=75)
| Tested three formats: chat demos, first-person statements ("I am C-3PO..."), and synthetic Wikipedia-style docs. Same model, same LoRA config, 500 examples each. First-person statements won on generalization, which I didn't expect. The synthetic doc model was the weirdest result: it knew C-3PO was anxious but only expressed it 37% of the time. Knowing a trait vs feeling it are apparently different things in weight space. Code and GitHub repo link are included inside! [link] [comments] |
Want to read more?
Check out the full article on the original site
Tagged with
#generative AI for data analysis
#Excel alternatives for data analysis
#real-time data collaboration
#rows.com
#natural language processing for spreadsheets
#big data management in spreadsheets
#conversational data analysis
#intelligent data visualization
#data visualization tools
#enterprise data management
#big data performance
#data analysis tools
#data cleaning solutions
#no-code spreadsheet solutions
#real-time collaboration
#C-3PO
#fine-tuned
#first-person statements
#persona injection
#generalization