ERIC - Search Results

Publication Date

In 2025	6
Since 2024	17
Since 2021 (last 5 years)	93
Since 2016 (last 10 years)	214
Since 2006 (last 20 years)	400

Descriptor

Test Items	808
Scoring	627
Test Construction	234
Item Response Theory	168
Difficulty Level	146
Test Reliability	146
Item Analysis	137
Multiple Choice Tests	125
Test Validity	123
Foreign Countries	122
Computer Assisted Testing	119
Psychometrics	108
Test Format	106
Comparative Analysis	96
Standard Setting (Scoring)	93
Scores	91
Mathematics Tests	87
Achievement Tests	80
Higher Education	79
Student Evaluation	78
Testing	78
Models	73
Language Tests	68
Scoring Formulas	68
Evaluation Methods	66
More ▼

Education Level

Secondary Education	75
Higher Education	72
Elementary Education	60
Postsecondary Education	59
Elementary Secondary Education	43
Middle Schools	29
High Schools	27
Junior High Schools	26
Grade 8	20
Early Childhood Education	17
Grade 4	15
Grade 5	14
Intermediate Grades	14
Grade 6	13
Primary Education	12
Grade 3	10
Grade 7	10
Grade 2	7
Kindergarten	6
Grade 1	3
Grade 10	2
Grade 12	2
Grade 9	2
Adult Education	1
Grade 11	1
More ▼

Audience

Practitioners	41
Teachers	39
Researchers	18
Administrators	9
Students	9
Parents	6
Counselors	1
Policymakers	1

Location

Canada	16
China	13
Arizona	11
Australia	10
Turkey	10
United Kingdom	10
California	8
Florida	8
Japan	7
Pennsylvania	6
United States	6
Germany	5
Nebraska	5
Taiwan	5
Tennessee	5
Netherlands	4
New York	4
United Kingdom (England)	4
Denmark	3
Europe	3
France	3
Hong Kong	3
Israel	3
Louisiana	3
New Mexico	3
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	3
Comprehensive Education…	2
Elementary and Secondary…	2
Education Consolidation…	1
Education for All Handicapped…	1
Individuals with Disabilities…	1
Kentucky Education Reform Act…	1
National Defense Education Act	1

What Works Clearinghouse Rating

Meets WWC Standards without Reservations	1
Meets WWC Standards with or without Reservations	1

Test Items X

Showing 1 to 15 of 808 results Save | Export

Scoring Running Records: Complexities and Affordances

Peer reviewed

Direct link

Rodgers, Emily; D'Agostino, Jerome V.; Berenbon, Rebecca; Johnson, Tracy; Winkler, Christa – Journal of Early Childhood Literacy, 2023

Running Records are thought to be an excellent formative assessment tool because they generate results that educators can use to make their teaching more responsive. Despite the technical nature of scoring Running Records and the kinds of important decisions that are attached to their analysis, few studies have investigated assessor accuracy. We…

Descriptors: Formative Evaluation, Scoring, Accuracy, Difficulty Level

Embedding Embedded Standard Setting: An Application of Cross-Classified Item Response Theory. CRESST Report 876

Download full text

Yun-Kyung Kim; Li Cai – National Center for Research on Evaluation, Standards, and Student Testing (CRESST), 2025

This paper introduces an application of cross-classified item response theory (IRT) modeling to an assessment utilizing the embedded standard setting (ESS) method (Lewis & Cook). The cross-classified IRT model is used to treat both item and person effects as random, where the item effects are regressed on the target performance levels (target…

Descriptors: Standard Setting (Scoring), Item Response Theory, Test Items, Difficulty Level

Testing for Differential Item Functioning under the "D"-Scoring Method

Peer reviewed

Direct link

Dimitrov, Dimiter M.; Atanasov, Dimitar V. – Educational and Psychological Measurement, 2022

This study offers an approach to testing for differential item functioning (DIF) in a recently developed measurement framework, referred to as "D"-scoring method (DSM). Under the proposed approach, called "P-Z" method of testing for DIF, the item response functions of two groups (reference and focal) are compared by…

Descriptors: Test Bias, Methods, Test Items, Scoring

Examination of the Aggregate Scoring Method in a Judgment Concordance Test

Peer reviewed
PDF on ERIC

Download full text

Deschênes, Marie-France; Dionne, Éric; Dorion, Michelle; Grondin, Julie – Practical Assessment, Research & Evaluation, 2023

The use of the aggregate scoring method for scoring concordance tests requires the weighting of test items to be derived from the performance of a group of experts who take the test under the same conditions as the examinees. However, the average score of experts constituting the reference panel remains a critical issue in the use of these tests.…

Descriptors: Scoring, Tests, Evaluation Methods, Test Items

An Examination of Individual Ability Estimation and Classification Accuracy under Rapid Guessing Misidentifications

Peer reviewed

Direct link

Rios, Joseph – Applied Measurement in Education, 2022

To mitigate the deleterious effects of rapid guessing (RG) on ability estimates, several rescoring procedures have been proposed. Underlying many of these procedures is the assumption that RG is accurately identified. At present, there have been minimal investigations examining the utility of rescoring approaches when RG is misclassified, and…

Descriptors: Accuracy, Guessing (Tests), Scoring, Classification

Analyzing Polytomous Test Data: A Comparison between an Information-Based IRT Model and the Generalized Partial Credit Model

Peer reviewed

Direct link

Joakim Wallmark; James O. Ramsay; Juan Li; Marie Wiberg – Journal of Educational and Behavioral Statistics, 2024

Item response theory (IRT) models the relationship between the possible scores on a test item against a test taker's attainment of the latent trait that the item is intended to measure. In this study, we compare two models for tests with polytomously scored items: the optimal scoring (OS) model, a nonparametric IRT model based on the principles of…

Descriptors: Item Response Theory, Test Items, Models, Scoring

Is Effort Moderated Scoring Robust to Multidimensional Rapid Guessing?

Peer reviewed

Direct link

Joseph A. Rios; Jiayi Deng – Educational and Psychological Measurement, 2025

To mitigate the potential damaging consequences of rapid guessing (RG), a form of noneffortful responding, researchers have proposed a number of scoring approaches. The present simulation study examines the robustness of the most popular of these approaches, the unidimensional effort-moderated (EM) scoring procedure, to multidimensional RG (i.e.,…

Descriptors: Scoring, Guessing (Tests), Reaction Time, Item Response Theory

The Impact of Scoring Later on Mixed Format Adaptive Testing

Direct link

Jing Ma – ProQuest LLC, 2024

This study investigated the impact of scoring polytomous items later on measurement precision, classification accuracy, and test security in mixed-format adaptive testing. Utilizing the shadow test approach, a simulation study was conducted across various test designs, lengths, number and location of polytomous item. Results showed that while…

Descriptors: Scoring, Adaptive Testing, Test Items, Classification

Online Calibration in Multidimensional Computerized Adaptive Testing with Polytomously Scored Items

Peer reviewed

Direct link

Yuan, Lu; Huang, Yingshi; Li, Shuhang; Chen, Ping – Journal of Educational Measurement, 2023

Online calibration is a key technology for item calibration in computerized adaptive testing (CAT) and has been widely used in various forms of CAT, including unidimensional CAT, multidimensional CAT (MCAT), CAT with polytomously scored items, and cognitive diagnostic CAT. However, as multidimensional and polytomous assessment data become more…

Descriptors: Computer Assisted Testing, Adaptive Testing, Computation, Test Items

Automated Marking of Longer Computational Questions in Engineering Subjects

Peer reviewed

Direct link

Pearson, Christopher; Penna, Nigel – Assessment & Evaluation in Higher Education, 2023

E-assessments are becoming increasingly common and progressively more complex. Consequently, how these longer, more complex questions are designed and marked is imperative. This article uses the NUMBAS e-assessment tool to investigate the best practice for creating longer questions and their mark schemes on surveying modules taken by engineering…

Descriptors: Automation, Scoring, Engineering Education, Foreign Countries

Investigation of Rater Tendencies and Reliability in Different Assessment Methods with Many Facet Rasch Model

Peer reviewed
PDF on ERIC

Download full text

Koçak, Duygu – International Electronic Journal of Elementary Education, 2020

One of the most commonly used methods for measuring higher-order thinking skills such as problem-solving or written expression is open-ended items. Three main approaches are used to evaluate responses to open-ended items: general evaluation, rating scales, and rubrics. In order to measure and improve problem-solving skills of students, firstly, an…

Descriptors: Interrater Reliability, Item Response Theory, Test Items, Rating Scales

Item Response Theory and Modeling with Stata

Peer reviewed

Direct link

Raykov, Tenko – Measurement: Interdisciplinary Research and Perspectives, 2023

This software review discusses the capabilities of Stata to conduct item response theory modeling. The commands needed for fitting the popular one-, two-, and three-parameter logistic models are initially discussed. The procedure for testing the discrimination parameter equality in the one-parameter model is then outlined. The commands for fitting…

Descriptors: Item Response Theory, Models, Comparative Analysis, Item Analysis

Setting and Validating Multiple Standards on a Multistage-Adaptive Test

Peer reviewed

Direct link

Lewis, Jennifer; Lim, Hwanggyu; Padellaro, Frank; Sireci, Stephen G.; Zenisky, April L. – Educational Measurement: Issues and Practice, 2022

Setting cut scores on (MSTs) is difficult, particularly when the test spans several grade levels, and the selection of items from MST panels must reflect the operational test specifications. In this study, we describe, illustrate, and evaluate three methods for mapping panelists' Angoff ratings into cut scores on the scale underlying an MST. The…

Descriptors: Cutting Scores, Adaptive Testing, Test Items, Item Analysis

Empirical Evidence of Students' Systems Thinking Skills in ESD-Oriented: A Rasch Analysis Approach

Peer reviewed
PDF on ERIC

Download full text

Ikmanisa Khairati; L. Lufri; Muhyiatul Fadilah – Journal of Biological Education Indonesia (Jurnal Pendidikan Biologi Indonesia), 2025

Education for Sustainable Development (ESD) serves as a key accelerator for achieving the Sustainable Development Goals (SDGs), emphasizing systems thinking as an essential competency that must be cultivated in the learning process. This study investigates students' systems thinking skills within the ESD framework through assessments on…

Descriptors: Systems Approach, Thinking Skills, Sustainable Development, Biology

2023-2024 NSCAS Growth: English Language Arts, Mathematics, and Science Technical Report

Download full text

Nebraska Department of Education, 2024

The Nebraska Student-Centered Assessment System (NSCAS) is a statewide assessment system that embodies Nebraska's holistic view of students and helps them prepare for success in postsecondary education, career, and civic life. It uses multiple measures throughout the year to provide educators and decision-makers at all levels with the insights…

Descriptors: Student Evaluation, Evaluation Methods, Elementary School Students, Middle School Students

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 54

Educational and Psychological…	39
Journal of Educational…	32
Applied Measurement in…	28
ETS Research Report Series	21
ProQuest LLC	21
Applied Psychological…	15
Grantee Submission	15
Educational Measurement:…	14
International Journal of…	13
Educational Assessment	12
Journal of Psychoeducational…	12
Online Submission	12
Journal of Educational and…	11
Practical Assessment,…	9
Psychometrika	9
Language Testing	8
Arizona Department of…	7
New Meridian Corporation	6
Assessment & Evaluation in…	5
College Board	5
Educational Testing Service	5
Evaluation and the Health…	5
International Journal of…	5
Journal of Chemical Education	5
Journal of Experimental…	5
More ▼

Plake, Barbara S.	15
Bennett, Randy Elliot	12
Schoen, Robert C.	8
Dimitrov, Dimiter M.	6
Hambleton, Ronald K.	6
Liu, Ou Lydia	6
Livingston, Samuel A.	6
Dorans, Neil J.	5
Impara, James C.	5
Linn, Marcia C.	5
Lord, Frederic M.	5
Wainer, Howard	5
Yang, Xiaotong	5
Anderson, Paul S.	4
Buckendahl, Chad W.	4
Carlson, James E.	4
Ferdous, Abdullah A.	4
Haladyna, Thomas M.	4
Holme, Thomas A.	4
Huynh, Huynh	4
Kim, Sooyeon	4
Paek, Insu	4
Sireci, Stephen G.	4
Stansfield, Charles W.	4
More ▼

Reports - Research	401
Journal Articles	398
Reports - Evaluative	186
Speeches/Meeting Papers	134
Reports - Descriptive	83
Tests/Questionnaires	55
Guides - Non-Classroom	45
Guides - Classroom - Teacher	24
Dissertations/Theses -…	21
Numerical/Quantitative Data	21
Information Analyses	18
Collected Works - General	11
Guides - General	10
Opinion Papers	10
Books	9
Guides - Classroom - Learner	9
ERIC Digests in Full Text	4
ERIC Publications	4
Non-Print Media	3
Collected Works - Proceedings	2
Multilingual/Bilingual…	2
Reference Materials - General	2
Reports - General	2
Book/Product Reviews	1
Historical Materials	1
More ▼

National Assessment of…	30
Graduate Record Examinations	14
SAT (College Admission Test)	13
Program for International…	12
Advanced Placement…	11
Test of English as a Foreign…	7
Trends in International…	6
Washington Assessment of…	6
ACT Assessment	5
TerraNova Multiple Assessments	5
Comprehensive Tests of Basic…	3
Graduate Management Admission…	3
Preliminary Scholastic…	3
Autism Diagnostic Observation…	2
Metropolitan Achievement Tests	2
National Teacher Examinations	2
Praxis Series	2
Raven Progressive Matrices	2
United States Medical…	2
ACT Interest Inventory	1
Alabama High School…	1
Alberta Grade Twelve Diploma…	1
COMPASS (Computer Assisted…	1
Career Maturity Inventory	1
Center for Epidemiologic…	1
More ▼