Framework

Our primary goal is to produce an evaluation framework and techncial test harness for the assessment of historical research tools powered by Large Language Models (LLM), prioritising scholarly and cultural values. A significant amount of this work will involve horizon scanning of existing best practices currently being developed across the research, GLAM, and commercial sectors. Versions of the framework will be listed here as they appear.