This is indirectly related but we found models that did well on structured output value generation tends to do well for row updates. We ran this benchmark here: [https://interfaze.ai/leaderboards/structured-output-benchmark](https://interfaze.ai/leaderboards/structured-output-benchmark) Open source does well like GLM 4.7 and Qwen. Closed source models like GPT 5.4 does really well but not something you would realistic use in prod since it's slow and expensive. The next thing we look as is modality support across image, text and audio and cost, since it's high volume of updates/insert with different modality support. We built our model, [interfaze.ai](http://interfaze.ai) for tasks like this, which has all modality support, is low cost and is train for task like database updates due to it's deterministic nature.