ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	10
Since 2016 (last 10 years)	21
Since 2006 (last 20 years)	59

Descriptor

Difficulty Level	82
Test Items	82
Test Construction	32
Models	20
Item Response Theory	19
Item Analysis	17
Foreign Countries	15
Computer Assisted Testing	14
Multiple Choice Tests	14
Student Evaluation	13
Evaluation Methods	12
Test Validity	12
Psychometrics	11
Test Bias	11
Comparative Analysis	9
Mathematics Tests	9
Test Reliability	9
Computation	8
Grade 4	8
Item Banks	8
Scoring	8
Adaptive Testing	7
Cognitive Processes	7
Higher Education	7
Reading Tests	7
More ▼

Publication Type

Reports - Descriptive	82
Journal Articles	58
Speeches/Meeting Papers	6
Numerical/Quantitative Data	3
Tests/Questionnaires	3
Computer Programs	2
Collected Works - Serials	1

Education Level

Higher Education	14
Postsecondary Education	11
Elementary Education	10
Grade 4	10
Elementary Secondary Education	7
Grade 8	7
Middle Schools	6
Intermediate Grades	5
Secondary Education	5
High Schools	4
Junior High Schools	4
Grade 12	3
Grade 3	3
Grade 6	3
Grade 1	2
Grade 2	2
Grade 5	2
Grade 7	2
Kindergarten	2
Early Childhood Education	1
Primary Education	1
More ▼

Audience

Teachers	4
Policymakers	3
Practitioners	1

Location

Canada	2
Australia	1
Austria	1
Belgium	1
California	1
Florida	1
Greece	1
Ireland	1
Japan	1
Norway	1
Saudi Arabia	1
Tennessee	1
Turkey	1
United Kingdom (Scotland)	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

National Assessment of…	4
Test of English as a Foreign…	2
Test of English for…	2
Florida Comprehensive…	1
Graduate Record Examinations	1
Program for International…	1
Raven Advanced Progressive…	1
SAT (College Admission Test)	1

What Works Clearinghouse Rating

Showing 1 to 15 of 82 results Save | Export

Parameters and Models of Item Response Theory (IRT): A Review of Literature

Peer reviewed

Direct link

Gyamfi, Abraham; Acquaye, Rosemary – Acta Educationis Generalis, 2023

Introduction: Item response theory (IRT) has received much attention in validation of assessment instrument because it allows the estimation of students' ability from any set of the items. Item response theory allows the difficulty and discrimination levels of each item on the test to be estimated. In the framework of IRT, item characteristics are…

Descriptors: Item Response Theory, Models, Test Items, Difficulty Level

On the Positive Correlation between DIF and Difficulty: A New Theory on the Correlation as Methodological Artifact

Peer reviewed

Direct link

Bolt, Daniel M.; Liao, Xiangyi – Journal of Educational Measurement, 2021

We revisit the empirically observed positive correlation between DIF and difficulty studied by Freedle and commonly seen in tests of verbal proficiency when comparing populations of different mean latent proficiency levels. It is shown that a positive correlation between DIF and difficulty estimates is actually an expected result (absent any true…

Descriptors: Test Bias, Difficulty Level, Correlation, Verbal Tests

Using Full-Information Item Analysis to Improve Item Quality

Peer reviewed

Direct link

Haladyna, Thomas M.; Rodriguez, Michael C. – Educational Assessment, 2021

Full-information item analysis provides item developers and reviewers comprehensive empirical evidence of item quality, including option response frequency, point-biserial index (PBI) for distractors, mean-scores of respondents selecting each option, and option trace lines. The multi-serial index (MSI) is introduced as a more informative…

Descriptors: Test Items, Item Analysis, Reading Tests, Mathematics Tests

Meeting Students Where They Are: Using Rasch Modeling for Improving the Measurement of Active Research in Higher Education

Peer reviewed

Direct link

Dahl, Laura S.; Staples, B. Ashley; Mayhew, Matthew J.; Rockenbach, Alyssa N. – Innovative Higher Education, 2023

Surveys with rating scales are often used in higher education research to measure student learning and development, yet testing and reporting on the longitudinal psychometric properties of these instruments is rare. Rasch techniques allow scholars to map item difficulty and individual aptitude on the same linear, continuous scale to compare…

Descriptors: Surveys, Rating Scales, Higher Education, Educational Research

Teacher-Made Tests: Why They Matter and a Framework for Analysing Mathematics Exams

Peer reviewed

Direct link

Wellberg, Sarah – Assessment in Education: Principles, Policy & Practice, 2023

Classroom assessment research in the United States has shifted away from the examination of teacher-made tests, but such tests are still widely used and have an enormous impact on students' educational experiences. Given the major shifts in educational policy in the United States, including the widespread adoption of the Common Core State…

Descriptors: Teacher Made Tests, Mathematics Tests, Common Core State Standards, Test Items

Analyzing and Visualizing Learning Data: A System Designer's Perspective

Peer reviewed
PDF on ERIC

Download full text

Pelanek, Radek – Journal of Learning Analytics, 2021

In this work, we consider learning analytics for primary and secondary schools from the perspective of the designer of a learning system. We provide an overview of practically useful analytics techniques with descriptions of their applications and specific illustrations. We highlight data biases and caveats that complicate the analysis and its…

Descriptors: Learning Analytics, Elementary Schools, Secondary Schools, Educational Technology

A Framework to Evaluate Cognitive Complexity in Mathematics Assessments

Download full text

Achieve, Inc., 2019

In 2013, the Council of Chief State School Officers (CCSSO), working collaboratively with state education agencies, released a set of criteria for states to use to evaluate and procure high-quality assessments. The mathematics section of the document included five content-specific criteria to evaluate alignment of assessments to college- and…

Descriptors: Mathematics Tests, Difficulty Level, Evaluation Criteria, Cognitive Processes

On Joining a Signal Detection Choice Model with Response Time Models

Peer reviewed

Direct link

DeCarlo, Lawrence T. – Journal of Educational Measurement, 2021

In a signal detection theory (SDT) approach to multiple choice exams, examinees are viewed as choosing, for each item, the alternative that is perceived as being the most plausible, with perceived plausibility depending in part on whether or not an item is known. The SDT model is a process model and provides measures of item difficulty, item…

Descriptors: Perception, Bias, Theories, Test Items

Question Banks for Effective Online Assessments in Introductory Science Courses

Peer reviewed

Direct link

Krzic, Maja; Brown, Sandra – Natural Sciences Education, 2022

The transition of our large ([approximately]300 student) introductory soil science course to the online setting created several challenges, including engaging first- and second-year students, providing meaningful hands-on learning activities, and setting up online exams. The objective of this paper is to describe the development and use of…

Descriptors: Introductory Courses, Social Sciences, Online Courses, Educational Change

Item Order and Speededness: Implications for Test Fairness in Higher Educational High-Stakes Testing

Peer reviewed

Direct link

Becker, Benjamin; van Rijn, Peter; Molenaar, Dylan; Debeer, Dries – Assessment & Evaluation in Higher Education, 2022

A common approach to increase test security in higher educational high-stakes testing is the use of different test forms with identical items but different item orders. The effects of such varied item orders are relatively well studied, but findings have generally been mixed. When multiple test forms with different item orders are used, we argue…

Descriptors: Information Security, High Stakes Tests, Computer Security, Test Items

Sustaining an Occupation-Specific Language Assessment for the Canadian Healthcare Field

Peer reviewed
PDF on ERIC

Download full text

Stewart, Gail; Strachan, Andrea – TESL Canada Journal, 2022

Since its implementation in 2004, the Canadian English Language Benchmark Assessment for Nurses (CELBAN) has been accepted as evidence of language ability for licensure of internationally educated nurses (IENs) in Canada. This article focuses on the complexities of sustaining an occupation-specific assessment over time. The authors reference the…

Descriptors: Language Tests, English for Special Purposes, Benchmarking, Nurses

On the Choice of Anchor Tests in Equating

Peer reviewed

Direct link

Sinharay, Sandip – Educational Measurement: Issues and Practice, 2018

The choice of anchor tests is crucial in applications of the nonequivalent groups with anchor test design of equating. Sinharay and Holland (2006, 2007) suggested "miditests," which are anchor tests that are content-representative and have the same mean item difficulty as the total test but have a smaller spread of item difficulties.…

Descriptors: Test Content, Difficulty Level, Test Items, Test Construction

Automated Test Paper Generation Using Utility Based Agent and Shuffling Algorithm

Peer reviewed

Direct link

El Rahman, Sahar Abd; Zolait, Ali Hussein – International Journal of Web-Based Learning and Teaching Technologies, 2019

This article describes how with the advent of computer-based technology, there is movement from manual to automated systems for different aspects of the education system. Testing is an essential part of teaching process that helps academics in classifying the level of students and evaluating the outcomes of their teaching process. The testing…

Descriptors: Test Items, Computer Uses in Education, Computers, Web Based Instruction

Toward Modeling the Intrinsic Complexity of Test Problems

Peer reviewed

Direct link

Shoufan, Abdulhadi – IEEE Transactions on Education, 2017

The concept of intrinsic complexity explains why different problems of the same type, tackled by the same problem solver, can require different times to solve and yield solutions of different quality. This paper proposes a general four-step approach that can be used to establish a model for the intrinsic complexity of a problem class in terms of…

Descriptors: Test Items, Difficulty Level, Problem Solving, Models

A Framework to Evaluate Cognitive Complexity in Science Assessments

Download full text

Achieve, Inc., 2019

Assessment is a key lever for educational improvement. Assessments can be used to monitor, signal, and influence science teaching and learning -- provided that they are of high quality, reflect the rigor and intent of academic standards, and elicit meaningful student performances. Since the release of "A Framework for K-12 Science…

Descriptors: Difficulty Level, Evaluation Criteria, Cognitive Processes, Test Items

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6

Journal of Educational…	6
Behavioral Research and…	4
Journal of Chemical Education	4
Journal of Educational and…	4
Educational and Psychological…	3
National Assessment Governing…	3
Psychometrika	3
Achieve, Inc.	2
Applied Psychological…	2
Educational Technology &…	2
Journal of Computer Assisted…	2
Journal of University…	2
Online Submission	2
Practical Assessment,…	2
Acta Educationis Generalis	1
American Biology Teacher	1
Applied Measurement in…	1
Assessment & Evaluation in…	1
Assessment in Education:…	1
College Entrance Examination…	1
Collegiate Microcomputer	1
Communique	1
Computers and Education	1
Education Digest: Essential…	1
Education Week	1
More ▼

Tindal, Gerald	4
Alonzo, Julie	3
Kubinger, Klaus D.	3
Camilli, Gregory	2
Cawthon, Stephanie W.	2
De Boeck, Paul	2
Linacre, John M.	2
Liu, Kimy	2
Prowker, Adam	2
Revuelta, Javier	2
Acquaye, Rosemary	1
Al-A'ali, Mansoor	1
Andrich, David	1
Arth, Thomas O.	1
Baghaei, Purya	1
Batchelder, William H.	1
Bechger, Timo M.	1
Becker, Benjamin	1
Belur, Madhu N.	1
Benderson, Albert, Ed.	1
Beretvas, S. Natasha	1
Bolt, Daniel M.	1
Brown, Sandra	1
Chaporkar, Prasanna	1
Chatzopoulou, D. I.	1
More ▼