NotesFAQContact Us
Collection
Advanced
Search Tips
Showing all 6 results Save | Export
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Jansen, Thorben; Vögelin, Cristina; Machts, Nils; Keller, Stefan; Möller, Jens – Frontline Learning Research, 2021
When judging subject-specific aspects of students' texts, teachers should assess various characteristics, e.g., spelling and content, independently of one another since these characteristics are indicators of different skills. Independent judgments enable teachers to adapt their classroom instruction according to students' skills. It is still…
Descriptors: Spelling, Punctuation, Writing Evaluation, Essays
Peer reviewed Peer reviewed
Direct linkDirect link
Ramsay, James O.; Wiberg, Marie – Journal of Educational and Behavioral Statistics, 2017
This article promotes the use of modern test theory in testing situations where sum scores for binary responses are now used. It directly compares the efficiencies and biases of classical and modern test analyses and finds an improvement in the root mean squared error of ability estimates of about 5% for two designed multiple-choice tests and…
Descriptors: Scoring, Test Theory, Computation, Maximum Likelihood Statistics
Peer reviewed Peer reviewed
Direct linkDirect link
Wei, Thomas E. – Educational Evaluation and Policy Analysis, 2012
Stereotype threat is frequently purported to be an important determinant of gender gaps in math. Unlike prior studies, which mostly occur in lab settings, I use data from the National Assessment of Educational Progress (NAEP)--a large, representative assessment of U.S. children--where through a design quirk, students are randomly assigned test…
Descriptors: Stereotypes, Mathematics Achievement, Teacher Attitudes, Bias
Peer reviewed Peer reviewed
Kane, Michael – Educational Measurement: Issues and Practice, 1998
Uses several thought experiments to explore the potential impact of the choice of criterion on the results of examinee-centered studies. Conclusions from these experiments are then used to examine the different cutting scores from several contrasting groups studies on the National Assessment of Educational Progress. (SLD)
Descriptors: Bias, Criteria, Cutting Scores, Selection
Mislevy, Robert J.; And Others – 1994
It is a common practice in item response theory (IRT) to treat estimates of item parameters, say "B" circumflex, as if they were the known, true quantities, "B." However, ignoring the uncertainty associated with item parameters can lead to biases and over-confidence in subsequent inferences such as ability estimation,…
Descriptors: Ability, Bias, Estimation (Mathematics), Item Response Theory
Educational Testing Service, Princeton, NJ. – 1971
The conference theme was "The Promise and Perils of Educational Information Systems," defined as collections of test data on knowledges, skills, interests, and attitudes maintained for the purpose of educational decision making. Topics covered were: "Longer Education: Thinner, Broader, or Higher" (Fritz Machlup); "Testing:…
Descriptors: Bayesian Statistics, Bias, Blacks, Conferences