Product Evals: Travel Planner, Long Context, and The Weight Of Taste

A product note on evaluating an AI travel planner: itinerary quality, OOD scenes, long-context consistency, recommendation taste, and user loops.

May 22, 2026 · 4 min · 664 words · jiaxing ni