ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	3

Source

Educational Evaluation and…	1
Educational Measurement:…	1
Frontline Learning Research	1
Journal of Educational and…	1

Author

Jansen, Thorben	1
Kane, Michael	1
Keller, Stefan	1
Machts, Nils	1
Mislevy, Robert J.	1
Möller, Jens	1
Ramsay, James O.	1
Vögelin, Cristina	1
Wei, Thomas E.	1
Wiberg, Marie	1

Publication Type

Journal Articles	4
Reports - Research	3
Reports - Evaluative	2
Tests/Questionnaires	1

Education Level

Higher Education	3
Postsecondary Education	2
Secondary Education	1

Audience

Location

United States	2
Germany	1
Sweden	1
Switzerland	1

Laws, Policies, & Programs

Assessments and Surveys

National Assessment of…

What Works Clearinghouse Rating

Showing all 6 results Save | Export

Don't Just Judge the Spelling! The Influence of Spelling on Assessing Second-Language Student Essays

Peer reviewed
PDF on ERIC

Download full text

Jansen, Thorben; Vögelin, Cristina; Machts, Nils; Keller, Stefan; Möller, Jens – Frontline Learning Research, 2021

When judging subject-specific aspects of students' texts, teachers should assess various characteristics, e.g., spelling and content, independently of one another since these characteristics are indicators of different skills. Independent judgments enable teachers to adapt their classroom instruction according to students' skills. It is still…

Descriptors: Spelling, Punctuation, Writing Evaluation, Essays

A Strategy for Replacing Sum Scoring

Peer reviewed

Direct link

Ramsay, James O.; Wiberg, Marie – Journal of Educational and Behavioral Statistics, 2017

This article promotes the use of modern test theory in testing situations where sum scores for binary responses are now used. It directly compares the efficiencies and biases of classical and modern test analyses and finds an improvement in the root mean squared error of ability estimates of about 5% for two designed multiple-choice tests and…

Descriptors: Scoring, Test Theory, Computation, Maximum Likelihood Statistics

Sticks, Stones, Words, and Broken Bones: New Field and Lab Evidence on Stereotype Threat

Peer reviewed

Direct link

Wei, Thomas E. – Educational Evaluation and Policy Analysis, 2012

Stereotype threat is frequently purported to be an important determinant of gender gaps in math. Unlike prior studies, which mostly occur in lab settings, I use data from the National Assessment of Educational Progress (NAEP)--a large, representative assessment of U.S. children--where through a design quirk, students are randomly assigned test…

Descriptors: Stereotypes, Mathematics Achievement, Teacher Attitudes, Bias

Criterion Bias in Examinee-Centered Standard Setting: Some Thought Experiments.

Peer reviewed

Kane, Michael – Educational Measurement: Issues and Practice, 1998

Uses several thought experiments to explore the potential impact of the choice of criterion on the results of examinee-centered studies. Conclusions from these experiments are then used to examine the different cutting scores from several contrasting groups studies on the National Assessment of Educational Progress. (SLD)

Descriptors: Bias, Criteria, Cutting Scores, Selection

Dealing with Uncertainty about Item Parameters: Expected Response Functions.

Download full text

Mislevy, Robert J.; And Others – 1994

It is a common practice in item response theory (IRT) to treat estimates of item parameters, say "B" circumflex, as if they were the known, true quantities, "B." However, ignoring the uncertainty associated with item parameters can lead to biases and over-confidence in subsequent inferences such as ability estimation,…

Descriptors: Ability, Bias, Estimation (Mathematics), Item Response Theory

Proceedings of the 1970 Invitational Conference on Testing Problems.

Download full text

Educational Testing Service, Princeton, NJ. – 1971

The conference theme was "The Promise and Perils of Educational Information Systems," defined as collections of test data on knowledges, skills, interests, and attitudes maintained for the purpose of educational decision making. Topics covered were: "Longer Education: Thinner, Broader, or Higher" (Fritz Machlup); "Testing:…

Descriptors: Bayesian Statistics, Bias, Blacks, Conferences

Bias	6
National Competency Tests	3
Comparative Analysis	2
Foreign Countries	2
Longitudinal Studies	2
Models	2
Ability	1
Assessment Literacy	1
Bayesian Statistics	1
Blacks	1
College Entrance Examinations	1
College Students	1
Computation	1
Conferences	1
Control Groups	1
Criteria	1
Cues	1
Cutting Scores	1
Data Analysis	1
Data Collection	1
Decision Making	1
Educational Improvement	1
Educational Needs	1
Efficiency	1
English (Second Language)	1
More ▼