ERIC - Search Results

Publication Date

In 2025	4
Since 2024	4
Since 2021 (last 5 years)	12
Since 2016 (last 10 years)	38
Since 2006 (last 20 years)	77

Descriptor

Cutting Scores	148
Test Items	148
Difficulty Level	52
Standard Setting (Scoring)	46
Test Construction	43
Item Response Theory	39
Item Analysis	36
Test Validity	25
Licensing Examinations…	23
Error of Measurement	21
Foreign Countries	19
Comparative Analysis	18
Test Reliability	18
Multiple Choice Tests	16
Standard Setting	16
Classification	15
Criterion Referenced Tests	15
Evaluation Methods	15
Interrater Reliability	14
Knowledge Level	14
Minimum Competency Testing	13
Psychometrics	13
Scoring	13
Achievement Tests	12
Higher Education	12
More ▼

Publication Type

Reports - Research	96
Journal Articles	74
Speeches/Meeting Papers	38
Reports - Evaluative	25
Reports - Descriptive	12
Guides - Non-Classroom	6
Numerical/Quantitative Data	6
Dissertations/Theses -…	5
Tests/Questionnaires	5
Reports - General	2
Books	1
Collected Works - General	1
Guides - Classroom - Teacher	1
Guides - General	1
Information Analyses	1
Opinion Papers	1
More ▼

Education Level

Higher Education	14
Postsecondary Education	13
Secondary Education	12
Elementary Secondary Education	10
Junior High Schools	6
Middle Schools	6
Elementary Education	5
Grade 5	5
Grade 3	3
Grade 8	3
High Schools	3
Grade 10	1
Grade 11	1
Grade 4	1
Grade 7	1
Grade 9	1
Kindergarten	1
More ▼

Audience

Researchers	11
Practitioners	3
Students	2
Teachers	2
Administrators	1
Policymakers	1

Location

Arkansas	2
California	2
Germany	2
New Mexico	2
Turkey	2
Canada	1
China	1
Europe	1
European Union	1
Jordan	1
Maryland	1
Massachusetts	1
Minnesota	1
Netherlands	1
New Jersey	1
Nigeria	1
Ohio	1
Pennsylvania	1
Peru	1
South Africa	1
Taiwan	1
Tennessee	1
Texas	1
Thailand	1
Turkey (Ankara)	1
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	2
Education Consolidation…	1

Assessments and Surveys

Praxis Series	4
New Jersey College Basic…	2
Test of English as a Foreign…	2
ACT Assessment	1
Advanced Placement…	1
Comprehensive Tests of Basic…	1
National Assessment of…	1
National Teacher Examinations	1
Test of English for…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 148 results Save | Export

Examining Appropriacy of CFI and TLI Cutoff Value in Multiple-Group CFA Test of Measurement Invariance to Enhance Accuracy of Test Score Interpretation

Peer reviewed

Direct link

Abdolvahab Khademi; Craig S. Wells; Maria Elena Oliveri; Ester Villalonga-Olives – SAGE Open, 2023

The most common effect size when using a multiple-group confirmatory factor analysis approach to measurement invariance is [delta]CFI and [delta]TLI with a cutoff value of 0.01. However, this recommended cutoff value may not be ubiquitously appropriate and may be of limited application for some tests (e.g., measures using dichotomous items or…

Descriptors: Factor Analysis, Factor Structure, Error of Measurement, Test Items

Empirically Deriving Cut Scores in the Positive Behavioral Interventions and Supports (PBIS) Tiered Fidelity Inventory (TFI) through a Bookmarking Process

Peer reviewed

Direct link

Jerin Kim; Kent McIntosh – Journal of Positive Behavior Interventions, 2025

We aimed to identify empirically valid cut scores on the positive behavioral interventions and supports (PBIS) Tiered Fidelity Inventory (TFI) through an expert panel process known as bookmarking. The TFI is a measurement tool to evaluate the fidelity of implementation of PBIS. In the bookmark method, experts reviewed all TFI items and item scores…

Descriptors: Positive Behavior Supports, Cutting Scores, Fidelity, Program Evaluation

Essentials of Visual Diagnosis of Test Items. Logical, Illogical, and Anomalous Patterns in Tests Items to Be Detected

Peer reviewed
PDF on ERIC

Download full text

Metsämuuronen, Jari – Practical Assessment, Research & Evaluation, 2022

This article discusses visual techniques for detecting test items that would be optimal to be selected to the final compilation on the one hand and, on the other hand, to out-select those items that would lower the quality of the compilation. Some classic visual tools are discussed, first, in a practical manner in diagnosing the logical,…

Descriptors: Test Items, Item Analysis, Item Response Theory, Cutting Scores

Self-Perceived Information Literacy Skills in Peruvian University Students: A Metric and Descriptive-Comparative Study

Peer reviewed
PDF on ERIC

Download full text

Gilber Chura-Quispe; Cristina Beatriz Flores-Rosado; Alex Alfredo Valenzuela-Romero; Enlil Iván Herrera-Pérez; Avenilda Eufemia Herrera-Chura; Mercedes Alejandrina Collazos Alarcón – Contemporary Educational Technology, 2025

Information literacy is a fundamental component in the academic development of future professionals. The aim of the study was to evaluate the metric properties of the 'questionnaire of self-perceived information competences', analyzing the factorial structure, internal consistency, convergent validity, factorial invariance according to gender and…

Descriptors: Information Literacy, College Students, Student Attitudes, Foreign Countries

Setting and Validating Multiple Standards on a Multistage-Adaptive Test

Peer reviewed

Direct link

Lewis, Jennifer; Lim, Hwanggyu; Padellaro, Frank; Sireci, Stephen G.; Zenisky, April L. – Educational Measurement: Issues and Practice, 2022

Setting cut scores on (MSTs) is difficult, particularly when the test spans several grade levels, and the selection of items from MST panels must reflect the operational test specifications. In this study, we describe, illustrate, and evaluate three methods for mapping panelists' Angoff ratings into cut scores on the scale underlying an MST. The…

Descriptors: Cutting Scores, Adaptive Testing, Test Items, Item Analysis

The Cronbach's Alpha of Domain-Specific Knowledge Tests before and after Learning: A Meta-Analysis of Published Studies

Peer reviewed

Direct link

Peter A. Edelsbrunner; Bianca A. Simonsmeier; Michael Schneider – Educational Psychology Review, 2025

Knowledge is an important predictor and outcome of learning and development. Its measurement is challenged by the fact that knowledge can be integrated and homogeneous, or fragmented and heterogeneous, which can change through learning. These characteristics of knowledge are at odds with current standards for test development, demanding a high…

Descriptors: Meta Analysis, Predictor Variables, Learning Processes, Knowledge Level

Using Diagnostic Profiles to Describe Borderline Performance in Standard Setting

Peer reviewed

Direct link

Skaggs, Gary; Hein, Serge F.; Wilkins, Jesse L. M. – Educational Measurement: Issues and Practice, 2020

In test-centered standard-setting methods, borderline performance can be represented by many different profiles of strengths and weaknesses. As a result, asking panelists to estimate item or test performance for a hypothetical group study of borderline examinees, or a typical borderline examinee, may be an extremely difficult task and one that can…

Descriptors: Standard Setting (Scoring), Cutting Scores, Testing Problems, Profiles

Reliability and Validity Evidence of Diagnostic Methods: Comparison of Diagnostic Classification Models and Item Response Theory-Based Methods

Direct link

Yoo Jeong Jang – ProQuest LLC, 2022

Despite the increasing demand for diagnostic information, observed subscores have been often reported to lack adequate psychometric qualities such as reliability, distinctiveness, and validity. Therefore, several statistical techniques based on CTT and IRT frameworks have been proposed to improve the quality of subscores. More recently, DCM has…

Descriptors: Classification, Accuracy, Item Response Theory, Correlation

A Method for Converting 4-Option Multiple-Choice Items to 3-Option Multiple-Choice Items without Re-Pretesting

Peer reviewed
PDF on ERIC

Download full text

Wolkowitz, Amanda A.; Foley, Brett; Zurn, Jared – Practical Assessment, Research & Evaluation, 2023

The purpose of this study is to introduce a method for converting scored 4-option multiple-choice (MC) items into scored 3-option MC items without re-pretesting the 3-option MC items. This study describes a six-step process for achieving this goal. Data from a professional credentialing exam was used in this study and the method was applied to 24…

Descriptors: Multiple Choice Tests, Test Items, Accuracy, Test Format

Embedded Standard Setting: Aligning Standard-Setting Methodology with Contemporary Assessment Design Principles

Peer reviewed

Direct link

Lewis, Daniel; Cook, Robert – Educational Measurement: Issues and Practice, 2020

In this paper we assert that the practice of principled assessment design renders traditional standard-setting methodology redundant at best and contradictory at worst. We describe the rationale for, and methodological details of, Embedded Standard Setting (ESS; previously, Engineered Cut Scores. Lewis, 2016), an approach to establish performance…

Descriptors: Standard Setting, Evaluation, Cutting Scores, Performance Based Assessment

Investigating the Classification Accuracy of Rasch and Nominal Weights Mean Equating with Very Small Samples

Peer reviewed

Direct link

Furter, Robert T.; Dwyer, Andrew C. – Applied Measurement in Education, 2020

Maintaining equivalent performance standards across forms is a psychometric challenge exacerbated by small samples. In this study, the accuracy of two equating methods (Rasch anchored calibration and nominal weights mean) and four anchor item selection methods were investigated in the context of very small samples (N = 10). Overall, nominal…

Descriptors: Classification, Accuracy, Item Response Theory, Equated Scores

Examining the Cut-Off Score of the English B1 Progression Exam According to Different Standard Setting Methods

Peer reviewed
PDF on ERIC

Download full text

Rümeysa Kaya; Bayram Çetin – International Journal of Assessment Tools in Education, 2025

In this study, the cut-off scores obtained from the Angoff, Angoff Y/N, Nedelsky and Ebel standard methods were compared with the 50 T score and the current cut-off score in various aspects. Data were collected from 448 students who took Module B1+ English Exit Exam IV and 14 experts. It was seen that while the Nedelsky method gave the lowest…

Descriptors: Standard Setting, Cutting Scores, Exit Examinations, Academic Achievement

Comparison of Passing Scores Determined by the Angoff Method in Different Item Samples

Peer reviewed
PDF on ERIC

Download full text

Kara, Hakan; Cetin, Sevda – International Journal of Assessment Tools in Education, 2020

In this study, the efficiency of various random sampling methods to reduce the number of items rated by judges in an Angoff standard-setting study was examined and the methods were compared with each other. Firstly, the full-length test was formed by combining Placement Test 2012 and 2013 mathematics subsets. After then, simple random sampling…

Descriptors: Cutting Scores, Standard Setting (Scoring), Sampling, Error of Measurement

Ensuring Fairness in Difficulty and Content among Parallel Assessments Generated from a Test-Item Database

Download full text

Parry, James R. – Online Submission, 2020

This paper presents research and provides a method to ensure that parallel assessments, that are generated from a large test-item database, maintain equitable difficulty and content coverage each time the assessment is presented. To maintain fairness and validity it is important that all instances of an assessment, that is intended to test the…

Descriptors: Culture Fair Tests, Difficulty Level, Test Items, Test Validity

Comparing Small-Sample Equating with Angoff Judgement for Linking Cut-Scores on Two Tests

Download full text

Bramley, Tom – Research Matters, 2020

The aim of this study was to compare, by simulation, the accuracy of mapping a cut-score from one test to another by expert judgement (using the Angoff method) versus the accuracy with a small-sample equating method (chained linear equating). As expected, the standard-setting method resulted in more accurate equating when we assumed a higher level…

Descriptors: Cutting Scores, Standard Setting (Scoring), Equated Scores, Accuracy

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10

Applied Measurement in…	12
Educational Measurement:…	7
Educational and Psychological…	7
ProQuest LLC	5
Journal of Educational…	4
Practical Assessment,…	4
ETS Research Report Series	3
International Journal of…	3
Language Assessment Quarterly	3
Eurasian Journal of…	2
International Journal of…	2
New Mexico Public Education…	2
Online Submission	2
Alberta Journal of…	1
Applied Psychological…	1
Brookes Publishing Company	1
Cambridge Assessment	1
Contemporary Educational…	1
Educational Psychology Review	1
Educational Sciences: Theory…	1
Educational Testing Service	1
Grantee Submission	1
ICHPER-SD Journal of Research	1
International Journal of…	1
Journal of Allied Health	1
More ▼

Hambleton, Ronald K.	5
Tannenbaum, Richard J.	5
Plake, Barbara S.	4
Wyse, Adam E.	4
Buckendahl, Chad W.	3
Clauser, Brian E.	3
Impara, James C.	3
Kim, Sooyeon	3
Margolis, Melissa J.	3
Bramley, Tom	2
Furter, Robert T.	2
Gerrow, Jack	2
Harsch, Claudia	2
Hein, Serge F.	2
Kannan, Priya	2
Mee, Janet	2
Meijer, Rob R.	2
Melican, Gerald J.	2
Newman, Larry S.	2
Puhan, Gautam	2
Sgammato, Adrienne	2
Sykes, Robert C.	2
Walker, Michael E.	2
Wang, Wen-Chung	2
More ▼