I’ve been considering finetuning llama3.170b for my specific use case, which involves highly specialized Salesforce logic. This area has a smaller online presence compared to widely-used open-source databases like PostgreSQL.
We’ve successfully implemented a pipeline that incorporates RLHF (Reinforcement Learning from Human Feedback) to assist with our fine-tuning process.
However, I have one main concern:
What if Sam releases a superior model in November? Would that render my investment of time, effort, and money in fine-tuning obsolete?