Hacker News

hedayet 3 days ago [ - ]

1. Can someone help me articulate what Tinker can do that Vertex AI or many others can't? (I can see access to some primitives, which is nice)

2. and more broadly: Has anyone got real lift in business metrics through fine-tuning an open model over using the flagship models from say OpenAI or Anthropic?

BoorishBears 3 days ago [ - ]

Most managed finetuning offerings take your dataset, some hyperparameters, and spit out a model. Few support RL, and those that do have very limited support.

And I have gotten a real lift, in cost effectiveness and engagement (for creative writing)

QuadmasterXLII 3 days ago [ - ]

What are you doing that requires RL-ing creative writing for engagement?

BoorishBears 2 days ago [ - ]

I don't apply RL directly to engagement (and don't think it's really possible without some insane scale of feedback)

Instead there are mechanical mistakes models make that harm engagement and are trivially verifiable (overused phrases and concepts, hitting a given target reading level, etc.)

Improving those is what improves engagement.