Ehud Reiter's Blog

follow: @[email protected]

Posts

Encouraging safer driving with NLG apps

I hate pay-to-publish

More on evaluating impact

Cycling in Netherlands

Patients want to know what information an AI model considers

The Aberdeen NLP Research Group

Key messages from my NLG book

Even good leaderboards may not be useful, because they are gamed

Examples of evaluating real-world impact

Benchmarks distract us from what matters

People do not understand how LLMs can/cannot help them