ERIC - Search Results

Publication Date

In 2025	0
Since 2024	3
Since 2021 (last 5 years)	6
Since 2016 (last 10 years)	23
Since 2006 (last 20 years)	47

Descriptor

Difficulty Level	69
Evaluation Methods	69
Test Items	69
Test Construction	19
Student Evaluation	18
Item Analysis	17
Item Response Theory	17
Foreign Countries	16
Test Validity	14
Multiple Choice Tests	11
Scores	11
Test Reliability	10
Correlation	9
Psychometrics	9
Test Bias	8
Comparative Analysis	7
Computer Assisted Testing	7
Evaluation Criteria	7
Standardized Tests	7
Cognitive Processes	6
Computation	6
Scientific Concepts	6
Statistical Analysis	6
Summative Evaluation	6
Test Format	6
More ▼

Publication Type

Journal Articles	40
Reports - Research	37
Reports - Descriptive	12
Reports - Evaluative	10
Speeches/Meeting Papers	5
Dissertations/Theses -…	3
Numerical/Quantitative Data	2
Collected Works - Proceedings	1
Guides - General	1
Information Analyses	1
Non-Print Media	1
Reference Materials - General	1
Tests/Questionnaires	1
More ▼

Education Level

Higher Education	8
Elementary Education	7
Postsecondary Education	7
Elementary Secondary Education	6
Secondary Education	5
Grade 6	4
Middle Schools	4
Grade 4	3
Grade 12	2
Grade 8	2
High Schools	2
Intermediate Grades	2
Early Childhood Education	1
Grade 10	1
Grade 11	1
Grade 7	1
Grade 9	1
Junior High Schools	1
Kindergarten	1
Primary Education	1
More ▼

Audience

Practitioners	1
Researchers	1
Teachers	1

Location

United Kingdom (England)	3
Canada	2
Germany	2
South Africa	2
Turkey	2
United States	2
Africa	1
Australia	1
Brazil	1
Colorado	1
Dominica	1
Ethiopia	1
Grenada	1
India	1
Indonesia	1
Israel	1
Italy	1
Libya	1
Malta	1
Netherlands	1
New Zealand	1
North Carolina	1
Oregon	1
Saint Lucia	1
Saint Vincent and the…	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Graduate Record Examinations	1
Hidden Figures Test	1
National Assessment of…	1
Program for International…	1
SAT (College Admission Test)	1

What Works Clearinghouse Rating

Showing 1 to 15 of 69 results Save | Export

Evaluating Methodological Enhancements to the Yes/No Angoff Standard-Setting Method in Language Proficiency Assessment

Peer reviewed

Direct link

Tia M. Fechter; Heeyeon Yoon – Language Testing, 2024

This study evaluated the efficacy of two proposed methods in an operational standard-setting study conducted for a high-stakes language proficiency test of the U.S. government. The goal was to seek low-cost modifications to the existing Yes/No Angoff method to increase the validity and reliability of the recommended cut scores using a convergent…

Descriptors: Standard Setting, Language Proficiency, Language Tests, Evaluation Methods

Relating Pictorial and Verbal Forms of Assessments of the Particle Model of Matter in Two Communities of Students

Peer reviewed

Direct link

Langbeheim, Elon; Akaygun, Sevil; Adadan, Emine; Hlatshwayo, Manzini; Ramnarain, Umesh – International Journal of Science and Mathematics Education, 2023

Linking assessment and curriculum in science education, particularly within the topic of matter and its changes, is often taken for granted. Some of the fundamental elements of the assessment, such as the choice of wording and visual representations, as well as its relation to the curricular sequence, remain understudied. In addition, very few…

Descriptors: Student Evaluation, Evaluation Methods, Science Education, Test Items

Impacts of Scoring Methods on Multiple-Select Multiple-Choice Item Statistics

Direct link

Alicia A. Stoltenberg – ProQuest LLC, 2024

Multiple-select multiple-choice items, or multiple-choice items with more than one correct answer, are used to quickly assess content on standardized assessments. Because there are multiple keys to these item types, there are also multiple ways to score student responses to these items. The purpose of this study was to investigate how changing the…

Descriptors: Scoring, Evaluation Methods, Multiple Choice Tests, Standardized Tests

It's Not Just Angoff: Misperceptions of Hard and Easy Items in Bookmark-Type Ratings

Peer reviewed

Direct link

Wyse, Adam E.; Babcock, Ben – Educational Measurement: Issues and Practice, 2020

A common belief is that the Bookmark method is a cognitively simpler standard-setting method than the modified Angoff method. However, a limited amount of research has investigated panelist's ability to perform well the Bookmark method, and whether some of the challenges panelists face with the Angoff method may also be present in the Bookmark…

Descriptors: Standard Setting (Scoring), Evaluation Methods, Testing Problems, Test Items

Multiple-Choice Questions (MCQs) for Higher-Order Cognition: Perspectives of University Teachers

Peer reviewed

Direct link

Qian Liu; Navé Wald; Chandima Daskon; Tony Harland – Innovations in Education and Teaching International, 2024

This qualitative study looks at multiple-choice questions (MCQs) in examinations and their effectiveness in testing higher-order cognition. While there are claims that MCQs can do this, we consider many assertions problematic because of the difficulty in interpreting what higher-order cognition consists of and whether or not assessment tasks…

Descriptors: Multiple Choice Tests, Critical Thinking, College Faculty, Student Evaluation

A Framework to Evaluate Cognitive Complexity in Mathematics Assessments

Download full text

Achieve, Inc., 2019

In 2013, the Council of Chief State School Officers (CCSSO), working collaboratively with state education agencies, released a set of criteria for states to use to evaluate and procure high-quality assessments. The mathematics section of the document included five content-specific criteria to evaluate alignment of assessments to college- and…

Descriptors: Mathematics Tests, Difficulty Level, Evaluation Criteria, Cognitive Processes

Examination of Common Exams Held by Measurement and Assessment Centers: Many Facet Rasch Analysis

Peer reviewed
PDF on ERIC

Download full text

Kaya Uyanik, Gulden; Demirtas Tolaman, Tugba; Gur Erdogan, Duygu – International Journal of Assessment Tools in Education, 2021

This paper aims to examine and assess the questions included in the "Turkish Common Exam" for sixth graders held in the first semester of 2018 which is one of the common exams carried out by The Measurement and Evaluation Centers, in terms of question structure, quality and taxonomic value. To this end, the test questions were examined…

Descriptors: Foreign Countries, Grade 6, Standardized Tests, Test Items

Ensuring Fairness in Difficulty and Content among Parallel Assessments Generated from a Test-Item Database

Download full text

Parry, James R. – Online Submission, 2020

This paper presents research and provides a method to ensure that parallel assessments, that are generated from a large test-item database, maintain equitable difficulty and content coverage each time the assessment is presented. To maintain fairness and validity it is important that all instances of an assessment, that is intended to test the…

Descriptors: Culture Fair Tests, Difficulty Level, Test Items, Test Validity

A Comparison of Score Equating Conducted Using Haebara and Stocking Lord Method for Polytomous

Peer reviewed
PDF on ERIC

Download full text

Setiawan, Risky – European Journal of Educational Research, 2019

The purposes of this research are: 1) to compare two equalizing tests conducted with Hebara and Stocking Lord method; 2) to describe the characteristics of each equalizing test method using windows' IRTEQ program. This research employs a participatory approach as the data are collected through questionnaires based on the National Examination…

Descriptors: Equated Scores, Evaluation Methods, Evaluation Criteria, Test Items

A Framework to Evaluate Cognitive Complexity in Mathematics Assessments

Download full text

Achieve, Inc., 2018

Descriptors: Mathematics Tests, Difficulty Level, Evaluation Criteria, Cognitive Processes

Differential Item Functioning Effect Size from the Multigroup Confirmatory Factor Analysis for a Meta-Analysis: A Simulation Study

Peer reviewed

Direct link

Park, Sung Eun; Ahn, Soyeon; Zopluoglu, Cengiz – Educational and Psychological Measurement, 2021

This study presents a new approach to synthesizing differential item functioning (DIF) effect size: First, using correlation matrices from each study, we perform a multigroup confirmatory factor analysis (MGCFA) that examines measurement invariance of a test item between two subgroups (i.e., focal and reference groups). Then we synthesize, across…

Descriptors: Item Analysis, Effect Size, Difficulty Level, Monte Carlo Methods

Exploration of Factors Affecting the Added Value of Test Subscores

Peer reviewed

Direct link

Wang, Xiaolin; Svetina, Dubravka; Dai, Shenghai – Journal of Experimental Education, 2019

Recently, interest in test subscore reporting for diagnosis purposes has been growing rapidly. The two simulation studies here examined factors (sample size, number of subscales, correlation between subscales, and three factors affecting subscore reliability: number of items per subscale, item parameter distribution, and data generating model)…

Descriptors: Value Added Models, Scores, Sample Size, Correlation

A Framework to Evaluate Cognitive Complexity in Science Assessments

Download full text

Achieve, Inc., 2019

Assessment is a key lever for educational improvement. Assessments can be used to monitor, signal, and influence science teaching and learning -- provided that they are of high quality, reflect the rigor and intent of academic standards, and elicit meaningful student performances. Since the release of "A Framework for K-12 Science…

Descriptors: Difficulty Level, Evaluation Criteria, Cognitive Processes, Test Items

Investigating the Comparability of Examination Difficulty Using Comparative Judgement and Rasch Modelling

Peer reviewed

Direct link

Holmes, Stephen D.; Meadows, Michelle; Stockford, Ian; He, Qingping – International Journal of Testing, 2018

The relationship of expected and actual difficulty of items on six mathematics question papers designed for 16-year olds in England was investigated through paired comparison using experts and testing with students. A variant of the Rasch model was applied to the comparison data to establish a scale of expected difficulty. In testing, the papers…

Descriptors: Foreign Countries, Secondary School Students, Mathematics Tests, Test Items

A Framework to Evaluate Cognitive Complexity in Reading Assessments

Download full text

Achieve, Inc., 2019

In 2013, the Council of Chief State School Officers (CCSSO), working collaboratively with state education agencies, released a set of criteria for states to use to evaluate and procure high-quality assessments. The English Language Arts (ELA)/Literacy section of the document included nine content-specific criteria to evaluate the alignment of…

Descriptors: Reading Skills, Student Evaluation, Evaluation Methods, Reading Tests

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5

Educational and Psychological…	6
Achieve, Inc.	4
ProQuest LLC	3
Applied Psychological…	2
Journal of Chemical Education	2
Journal of Educational…	2
Journal of Experimental…	2
College Board	1
Computers and Education	1
ETS Research Report Series	1
Educational Measurement:…	1
Educational Technology &…	1
Educational Testing Service	1
English Teaching Forum	1
European Journal of…	1
European Journal of…	1
Foreign Language Annals	1
Innovations in Education and…	1
International Association for…	1
International Journal of…	1
International Journal of…	1
International Journal of…	1
International Journal of…	1
International Journal of…	1
International Journal of…	1
More ▼

Bejar, Isaac I.	2
Merz, William R.	2
Adadan, Emine	1
Agran, Martin	1
Ahn, Soyeon	1
Akaygun, Sevil	1
Alexander, Patricia A.	1
Alicia A. Stoltenberg	1
Babcock, Ben	1
Baptista Nunes, Miguel, Ed.	1
Barry, Carol	1
Beauchamp, David	1
Belur, Madhu N.	1
Benson, Teddi L.	1
Beretvas, S. Natasha	1
Bernholt, Sascha	1
Borowski, Andreas	1
Bown, Jennifer	1
Burdis, Jacob	1
Camilli, Gregory	1
Carlson, Alfred B.	1
Cawthon, Stephanie W.	1
Chandima Daskon	1
Chaporkar, Prasanna	1
Chen, Ching-I	1
More ▼