Judge AI game makers by the first playable result
The best AI game maker for one team may be the wrong one for another. A useful evaluation starts with the first playable result: how quickly you reach it, how clearly you can judge it, and how easy it is to revise.
For short browser games, the tool should reduce the distance between idea and playtest. Fancy ideation alone is not enough if the result is hard to open or hard to adjust.
Look at workflow, not just feature lists
Compare where the prompt lives, whether the draft is playable in context, and whether follow-up edits stay close to the game instead of splitting into disconnected tools.
That is the difference between a tool that helps you test a game idea and a tool that only helps you describe one.