Supervised fine tuning on curated data is reinforcement learning

Status
Not open for further replies.
Status
Not open for further replies.
Top