Blog
Posts for category: Opinion
-
Can Foundation Models Really Replace Human Judgement? What GPT-5.2's Launch Today Tells Us, and Why RM Compare Matters More Than Ever
Today, December 11, 2025, OpenAI launched GPT-5.2. If you've been watching the space, you might be wondering: is this it? Have we reached the point where foundation models can truly replace human judgment? The answer is still no. And understanding why is critical for anyone building assessment, hiring, or decision systems that matter.
-
The £63 Billion Question: Why "Evolution" isn't enough to solve the UK's productivity crisis
The regulator has chosen a slow path for digital assessment. The economy can’t afford to wait. Here is why the future of skills depends on Transformation, not just Substitution.
-
The Skills Imperative 2035: Why the Future of Assessment Can’t Be a Tick-Box Exercise
The latest NFER report confirms that the skills of the future - Creativity, Collaboration, and Problem Solving - are the hardest to measure. Here is how we solve that.
-
Privacy by Design: Why RM Compare Doesn't Import Student Rosters
We don't import your student rosters. We don't ingest names, IDs, or demographic data linked to assessment work. We use email addresses only where operationally necessary. This isn't a constraint we're managing around. It's a deliberate design choice. Here's why.
-
Playing Nicely with Rubrics: How RM Compare Enhances Absolute Assessment in the Age of AI
In a landscape where generative AI has eroded confidence in traditional written assessment signals, education leaders face an uncomfortable truth: the rubrics they've carefully crafted may no longer be fit for purpose on their own. Yet abandoning rubrics entirely isn't the answer.
-
The Validation Layer: Why AI Needs a Human Anchor to be Safe
We are currently living through an "Assessment Arms Race." On one side, students and job candidates are using Generative AI (like ChatGPT) to produce "perfect" essays and CVs in seconds. On the other side, institutions are rushing to buy AI marking tools to grade that work just as fast. It is a closed loop of machines grading machines. And in the middle of this loop, the human element - the actual understanding of quality - is quietly disappearing.
-
The Myth of the "Perfect Photo": Why Mobile Capture can be Fairer, Faster, and Just as Reliable
We assume that a blurry photo hides the quality of the work. We assume that if a drawing is photographed under yellow classroom lights, the examiner will think the drawing itself is yellow. We assume that Image Quality = Assessment Quality. But science suggests we are worrying about the wrong thing.
-
Why You See a Cup, but AI Sees Pixels: The "Amodal" Secret to Reliable Assessment
Look at the header image above. What do you see?
-
Skills England's New Framework – Why human judgement matters more than ever
In November 2025, Skills England released the UK Standard Skills Classification (UK‑SSC): a comprehensive mapping of 3,343 occupational skills, 4,926 knowledge concepts, 21,963 occupational tasks and 13 core skills, linked across occupations, qualifications and sectors.