ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	4
Since 2016 (last 10 years)	5
Since 2006 (last 20 years)	7

Descriptor

Error of Measurement	9
Goodness of Fit	9
Scoring	9
Item Response Theory	6
Psychometrics	6
Interrater Reliability	5
Test Validity	5
Correlation	4
Test Items	4
Test Reliability	4
Computation	3
Language Tests	3
Academic Standards	2
Accuracy	2
At Risk Students	2
Children	2
Computer Assisted Testing	2
Cutting Scores	2
Diagnostic Tests	2
English	2
Language Impairments	2
Mathematics Achievement	2
Measures (Individuals)	2
Narration	2
Public Education	2
More ▼

Source

Educational and Psychological…	2
New Mexico Public Education…	2
Educational Assessment	1
Grantee Submission	1
Language, Speech, and Hearing…	1

Author

Anna-Maria Fall	2
Beula M. Magimairaj	2
Greg Roberts	2
Philip Capin	2
Ronald B. Gillam	2
Sandra L. Gillam	2
Sharon Vaughn	2
Bulut, Okan	1
D'Costa, Ayres G.	1
Dimitrov, Dimiter M.	1
Gorgun, Guher	1
Griph, Gerald W.	1
Linacre, John M.	1
Stefanie A. Wind	1
Westfall, Philip Jean-Louis	1
Yangmeng Xu	1
More ▼

Publication Type

Reports - Research	6
Journal Articles	5
Numerical/Quantitative Data	2
Reports - Descriptive	2
Reports - Evaluative	1
Speeches/Meeting Papers	1

Education Level

Elementary Secondary Education	2
Early Childhood Education	1
Elementary Education	1
Grade 2	1
Grade 3	1
Primary Education	1

Audience

Location

New Mexico

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 9 results Save | Export

Resolving and Re-Scoring Constructed Response Items in Mixed-Format Assessments: An Exploration of Three Approaches

Peer reviewed

Direct link

Stefanie A. Wind; Yangmeng Xu – Educational Assessment, 2024

We explored three approaches to resolving or re-scoring constructed-response items in mixed-format assessments: rater agreement, person fit, and targeted double scoring (TDS). We used a simulation study to consider how the three approaches impact the psychometric properties of student achievement estimates, with an emphasis on person fit. We found…

Descriptors: Interrater Reliability, Error of Measurement, Evaluation Methods, Examiners

Modeling of Item Response Functions under the D-Scoring Method

Peer reviewed

Direct link

Dimitrov, Dimiter M. – Educational and Psychological Measurement, 2020

This study presents new models for item response functions (IRFs) in the framework of the D-scoring method (DSM) that is gaining attention in the field of educational and psychological measurement and largescale assessments. In a previous work on DSM, the IRFs of binary items were estimated using a logistic regression model (LRM). However, the LRM…

Descriptors: Item Response Theory, Scoring, True Scores, Scaling

A Polytomous Scoring Approach to Handle Not-Reached Items in Low-Stakes Assessments

Peer reviewed

Direct link

Gorgun, Guher; Bulut, Okan – Educational and Psychological Measurement, 2021

In low-stakes assessments, some students may not reach the end of the test and leave some items unanswered due to various reasons (e.g., lack of test-taking motivation, poor time management, and test speededness). Not-reached items are often treated as incorrect or not-administered in the scoring process. However, when the proportion of…

Descriptors: Scoring, Test Items, Response Style (Tests), Mathematics Tests

Online Administration of the Test of Narrative Language--Second Edition: Psychometrics and Considerations for Remote Assessment

Peer reviewed
PDF on ERIC

Download full text

Direct link

Beula M. Magimairaj; Philip Capin; Sandra L. Gillam; Sharon Vaughn; Greg Roberts; Anna-Maria Fall; Ronald B. Gillam – Grantee Submission, 2022

Purpose: Our aim was to evaluate the psychometric properties of the online administered format of the Test of Narrative Language--Second Edition (TNL-2; Gillam & Pearson, 2017), given the importance of assessing children's narrative ability and considerable absence of psychometric studies of spoken language assessments administered online.…

Descriptors: Computer Assisted Testing, Language Tests, Story Telling, Language Impairments

Online Administration of the Test of Narrative Language--Second Edition: Psychometrics and Considerations for Remote Assessment

Peer reviewed
PDF on ERIC

Download full text

Direct link

Beula M. Magimairaj; Philip Capin; Sandra L. Gillam; Sharon Vaughn; Greg Roberts; Anna-Maria Fall; Ronald B. Gillam – Language, Speech, and Hearing Services in Schools, 2022

Descriptors: Computer Assisted Testing, Language Tests, Story Telling, Language Impairments

Rank Ordering or Judge-Awarded Ratings?

Download full text

Linacre, John M. – 1990

Rank ordering examinees is an easier task for judges than is awarding numerical ratings. A measurement model for rankings based on Rasch's objectivity axioms provides linear, sample-independent and judge-independent measures. Estimates of examinee measures are obtained from the data set of rankings, along with standard errors and fit statistics.…

Descriptors: Comparative Analysis, Error of Measurement, Essay Tests, Evaluators

New Mexico Standards-Based Assessment Technical Report: Spring 2007 Administration

Download full text

New Mexico Public Education Department, 2007

The purpose of the NMSBA technical report is to provide users and other interested parties with a general overview of and technical characteristics of the 2007 NMSBA. The 2007 technical report contains the following information: (1) Test development; (2) Scoring procedures; (3) Summary of student performance; (4) Statistical analyses of item and…

Descriptors: Interrater Reliability, Standard Setting, Measures (Individuals), Scoring

Improving Prediction by Correcting Test Scores for Person Disturbances Using the Rasch Model.

Download full text

Westfall, Philip Jean-Louis; D'Costa, Ayres G. – 1987

This study, based on the Rasch model, used R. M. Smith's (1986) classification of measurement disturbances to assess the Rasch model approach to error control and statistical prediction. Partitioning the error component into a person component, an item-person interaction component, and a random unexplained error component has the net effect of…

Descriptors: Classification, College Entrance Examinations, Error of Measurement, French

New Mexico Standards Based Assessment (NMSBA) Technical Report: 2006 Spring Administration

Download full text

Griph, Gerald W. – New Mexico Public Education Department, 2006

The purpose of the NMSBA technical report is to provide users and other interested parties with a general overview of and technical characteristics of the 2006 NMSBA. The 2006 technical report contains the following information: (1) Test development; (2) Scoring procedures; (3) Calibration, scaling, and equating procedures; (4) Standard setting;…

Descriptors: Interrater Reliability, Standard Setting, Measures (Individuals), Scoring