Prompting a large foundation model to do a simple task like classification is often overkill. Sentiment analysis is an example of a classification task where you can get the same results by fine-tuning a much smaller, cheaper, and faster model. In this example, we will take a dataset of financial news statements and fine-tune Llama 3.2 3B to categorize them as bad, good, or great news.
mathi-unsightly-jade-crab_answers
) with the modelās responses.
The advantage of our unified API is that I was able to reuse an old evaluation script and just change the model name and the dataset it was evaluating instead of writing a new API configuration.
While it seems like the model is doing pretty well, we can see there are still a couple of mistakes.