Ehud Reiter's Blog
Encouraging safer driving with NLG apps
I hate pay-to-publish
More on evaluating impact
Cycling in Netherlands
Patients want to know what information an AI model considers
The Aberdeen NLP Research Group
Key messages from my NLG book
Even good leaderboards may not be useful, because they are gamed
Examples of evaluating real-world impact
Benchmarks distract us from what matters
People do not understand how LLMs can/cannot help them