Oxen.ai helps you compare results from your machine learning models.
validation_response
).
We can see here that our models didn’t output exactly “True” or “False” like they were told to. So we added a column processed_response
to show a clean difference between the outputs.
processed_response
’s are different in each file.
llama_chat
in this case) didn’t really provide an answer, as it responded with both “True and False”.