ERIC - Search Results

Publication Date

In 2026	0
Since 2025	1
Since 2022 (last 5 years)	1
Since 2017 (last 10 years)	3
Since 2007 (last 20 years)	5

Descriptor

Quality Control	6
Test Construction	3
Computer Assisted Testing	2
Equated Scores	2
Error Correction	2
Evaluation Methods	2
Scoring	2
Teacher Role	2
Academic Achievement	1
Academic Standards	1
Accountability	1
Artificial Intelligence	1
Automation	1
Change	1
Check Lists	1
Classification	1
Cloze Procedure	1
College Entrance Examinations	1
Data	1
Data Analysis	1
Error Patterns	1
Guidance Programs	1
Improvement Programs	1
Instructional Materials	1
Learning Modules	1
More ▼

Source

Educational Measurement:…

Author

Allalouf, Avi	2
Baumer, Michal	1
Carragher, Natacha	1
Dorans, Neil J.	1
Guher Gorgun	1
Gutentag, Tony	1
Jones, Phillip	1
Liu, Jinghua	1
Okan Bulut	1
Roschewski, Pat	1
Shulruf, Boaz	1
Templin, Jonathan	1
Velan, Gary	1
More ▼

Publication Type

Journal Articles	6
Reports - Descriptive	5
Reports - Research	1

Education Level

Audience

Location

Nebraska

Laws, Policies, & Programs

Assessments and Surveys

SAT (College Admission Test)

What Works Clearinghouse Rating

Showing all 6 results Save | Export

Instruction-Tuned Large-Language Models for Quality Control in Automatic Item Generation: A Feasibility Study

Peer reviewed

Direct link

Guher Gorgun; Okan Bulut – Educational Measurement: Issues and Practice, 2025

Automatic item generation may supply many items instantly and efficiently to assessment and learning environments. Yet, the evaluation of item quality persists to be a bottleneck for deploying generated items in learning and assessment settings. In this study, we investigated the utility of using large-language models, specifically Llama 3-8B, for…

Descriptors: Artificial Intelligence, Quality Control, Technology Uses in Education, Automation

Digital Module 04: Diagnostic Measurement: Modeling Checklists for Practitioners https://ncme.elevate.commpartners.com

Peer reviewed

Direct link

Carragher, Natacha; Templin, Jonathan; Jones, Phillip; Shulruf, Boaz; Velan, Gary – Educational Measurement: Issues and Practice, 2019

In this ITEMS module, we provide a didactic overview of the specification, estimation, evaluation, and interpretation steps for diagnostic measurement/classification models (DCMs), which are a promising psychometric modeling approach. These models can provide detailed skill- or attribute-specific feedback to respondents along multiple latent…

Descriptors: Measurement, Classification, Models, Check Lists

Quality Control for Scoring Tests Administered in Continuous Mode: An NCME Instructional Module

Peer reviewed

Direct link

Allalouf, Avi; Gutentag, Tony; Baumer, Michal – Educational Measurement: Issues and Practice, 2017

Quality control (QC) in testing is paramount. QC procedures for tests can be divided into two types. The first type, one that has been well researched, is QC for tests administered to large population groups on few administration dates using a small set of test forms (e.g., large-scale assessment). The second type is QC for tests, usually…

Descriptors: Quality Control, Scoring, Computer Assisted Testing, Error Patterns

Assessing a Critical Aspect of Construct Continuity when Test Specifications Change or Test Forms Deviate from Specifications

Peer reviewed

Direct link

Liu, Jinghua; Dorans, Neil J. – Educational Measurement: Issues and Practice, 2013

We make a distinction between two types of test changes: inevitable deviations from specifications versus planned modifications of specifications. We describe how score equity assessment (SEA) can be used as a tool to assess a critical aspect of construct continuity, the equivalence of scores, whenever planned changes are introduced to testing…

Descriptors: Tests, Test Construction, Test Format, Change

An NCME Instructional Module on Quality Control Procedures in the Scoring, Equating, and Reporting of Test Scores

Peer reviewed

Direct link

Allalouf, Avi – Educational Measurement: Issues and Practice, 2007

There is significant potential for error in long production processes that consist of sequential stages, each of which is heavily dependent on the previous stage, such as the SER (Scoring, Equating, and Reporting) process. Quality control procedures are required in order to monitor this process and to reduce the number of mistakes to a minimum. In…

Descriptors: Scoring, Quality Control, Sequential Approach, Error Correction

History and Background of Nebraska's School-Based Teacher-Led Assessment and Reporting System (STARS)

Peer reviewed

Direct link

Roschewski, Pat – Educational Measurement: Issues and Practice, 2004

Nebraska's approach to standards, assessment, and accountability, the School-based Teacher-led Assessment and Reporting System (STARS) is based upon local control and the belief that classrooms and teachers must be at the heart of student learning and accountability. STARS relies on locally-developed assessment systems to accurately measure and…

Descriptors: Accountability, Student Evaluation, Evaluation Methods, Teacher Role