How to build a human in the loop evaluation workflow.
RemoteRepo
Python class. For your use case, a small dataset is better than none, and you can always scale up. Even if it is only a few examples to start, this allows you to setup and kick off your data flywheel.
gpt-4.1-nano
to see if OpenAIβs fast and cheap model can perform the operations we need.
To start, make a cell at the top of the notebook that allows the user to put in their own OpenAI API_KEY.
with mo.status.progress_bar(total=len(df)) as bar:
is a Marimo feature that allows you to display a progress bar in the notebook to help you visualize the progress of the loop. This is helpful when you have more than 10 examples and want to know how much longer the loop will take.
remote_df
that we can use in our labeling tool.
remote_df.commit()
method.