Opinion

Ranks, rulers and grades - a worked example

By Mark House

5th jun 2023

Hopefully you have had a chance to take a look at some of the new content in the Help Centre relating to the arrival of RM Compare Manual On-Demand.

In the following example we have taken a rank order, turned it into a ruler and applied some grade boundaries as a way to highlight some important concepts and understandings.

The Rank

We used the James Rank Order Session data for the purposes of this task. This is available as part of the Free Plan.

Candidates: 131 year 6 students from a federation of 13 schools all located in the same local authority. All schools had a non-selective intake.
Item Stimulus: "The local council have written to you as part of a consultation process. They are proposing to sell part of your school field to raise some money. All money raised would go toward education in the city - some of it will be allocated to your school to be spent on facility improvement. You have been asked to respond to the consultation by writing a formal letter to the council." (Note! additional information was provided to students to help them to consider the issues at hand. They were also taught the key writing skills needed for completing a task like this).
Items: Handwritten (Available here in the Demo Item Bank).
Judges: 36 teachers. All schools were represented in the judging pool.
Holistic Statement: "Which is the most effective piece of writing?" (Additional descriptors were provided to Judges).
Results: below (visit the Free Plan to see the full reporting data)

The Ruler

We have written before about the principle of transforming 'true scores' to 'scaled scores' as part of an approach toward grading.

Once we have items in a scaled score we can apply ruler boundaries, in this case at every 5% points. Within each ruler section we were then able to select a standardised item from the centre of the range (if your rank creation session is smaller you may just choose all the items within).

We now have a ruler with 20 standardised items on it. In the example below this presents itself as 5 items in each section of the ruler made available to Judges.

Grading

There are a number of approaches that could be considered for creating grade boundaries (shown in the example below)

Ranked: E.g. After Item 20 on the rank order
Scaled: E.g. After the top 10% of the scaled scores
Seeded: E.g. After the seeded item 17450.

A candidate experience - Bazza Timpson

Let's take a look at candidate Bazza Timpson. We can see (above) that his position on the original rank ('True Score') was 19. Depending on the grade boundary adopted here he may be awarded an A or a B grade.

On the scaled score we can see that he falls into the Ruler Item 3 Band - these bands are aligned to Scaled Grades. Items in Band 3 are graded B if we use the Scaled Grades.

Matching and Mis-Matching

In the following example 8 items were taken from the original rank. An attempt was then made by a user unfamiliar with the rank to complete a matching exercise using the ruler.

We can take these results and compare them with what we expected based on the original rank.

Name	Expect Match	Scaled Grade	Actual Match	Match Grade
L Timpson	1	A	1	A
B Timpson	3	B	4	B
T Flanders	6	B	6	B
R Flanders	8	C	8	C
N Munch	13	D	15	D
J Lovejoy	16	D	15	D
D Griffin	16	D	16	D
M Houten	20	E	19	E

We are able to observe a few important things here including

The Judge in this case made an exact match 50% of the time.
The error on mis-matches was small deviating by 1 ruler placement in 3 cases, and by 2 in another
The grade awarded was the same as expected

The crucial benefit of this process of course is the help it provides to assessors to become familiar with the standard, which takes us back to a core purpose of our work:

"How much simpler it would all be if teachers had – as a matter of normal practice – access to, and familiarity with, work from a national sample of schools, not just their own classroom?
Emeritus Professor Richard Kimbell

An Assessor familiar with the standard will have the confidence and competence to make judgements on new items as the come across them in their day-to-day work. Where they are uncertain they can use the ruler to assess any new items by repeating the matching exercise.

Always learning

A reminder that you can see our Roadmap here. You will see that this currently steers us toward building and consolidating on the ruler work completed so far while also looking toward a version that utilizes more fully Adaptive Comparative Judgement.

There is still a lot to learn and we very much welcome your feedback.

Over to you!

We can't wait to see what our users do with this new functionality. You will need an Advanced License to create a distribute rulers. Get in touch to find out more.

Group	Name	Domain	Expiration	Security	Purpose
necessary	csrftoken	compare.rm.com	365 days, 0:00:00	HTTP	Helps prevent CSRF attacks
necessary	_cf_bm	vimeo.com	1 day, 0:00:00	HTTP	Used to distinguish between humans and bots
preferences	wtm	compare.rm.com	365 days, 0:00:00	HTTP	Used to store users cookie preference choices
statistics	_ga	rm.com	365 days, 0:00:00	HTTP	Registers a unique ID used to generate statistical data on how visitor used the website
statistics	_ga_#	rm.com	365 days, 0:00:00	HTTP	Used by Google Analytics to collect data on user visits to the website
statistics	_hp2_#	rm.com	1 day, 0:00:00	HTTP	Collects data on the user's navigation and behaviour on the website
statistics	_hp2_id.#	rm.com	365 days, 0:00:00	HTTP	Collects data on the user's navigation and behaviour on the website
statistics	_hp2_ses_props.#	rm.com	1 day, 0:00:00	HTTP	Collects data on the user's navigation and behaviour on the website
statistics	vuid	vimeo.com	365 days, 0:00:00	HTTP	Collects data on the user's visits to the website
marketing	td	googletagmanager.com	0:00:00	HTTP	Used by Google Tag Manager to collect data on the user behaviour and interaction with the website
marketing	h	heapanalytics.com	0:00:00	HTTP	Collects data on the user behaviour and interaction with the website

Name	Domain	Purpose	Expiration	Security
csrftoken	compare.rm.com	Helps prevent CSRF attacks	365 days, 0:00:00	HTTP
_cf_bm	vimeo.com	Used to distinguish between humans and bots	1 day, 0:00:00	HTTP

Name	Domain	Purpose	Expiration	Security
_ga	rm.com	Registers a unique ID used to generate statistical data on how visitor used the website	365 days, 0:00:00	HTTP
_ga_#	rm.com	Used by Google Analytics to collect data on user visits to the website	365 days, 0:00:00	HTTP
_hp2_#	rm.com	Collects data on the user's navigation and behaviour on the website	1 day, 0:00:00	HTTP
_hp2_id.#	rm.com	Collects data on the user's navigation and behaviour on the website	365 days, 0:00:00	HTTP
_hp2_ses_props.#	rm.com	Collects data on the user's navigation and behaviour on the website	1 day, 0:00:00	HTTP
vuid	vimeo.com	Collects data on the user's visits to the website	365 days, 0:00:00	HTTP

Name	Domain	Purpose	Expiration	Security
td	googletagmanager.com	Used by Google Tag Manager to collect data on the user behaviour and interaction with the website	0:00:00	HTTP
h	heapanalytics.com	Collects data on the user behaviour and interaction with the website	0:00:00	HTTP