2025年2月2日 星期日

River crossing puzzle 3

 





(fails to solve. Helps illustrate.)

Why LLM performs poorly with reasoning and planning








artifact (js)






Monologue is best.


reasoning mode in o1



Talking to itself



Running into difficulties? Cute



hours? Ha Ha Ha












沒有留言:

張貼留言