ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	4
Since 2006 (last 20 years)	9

Descriptor

Scoring	32
True Scores	32
Error of Measurement	9
Measurement Techniques	9
Item Response Theory	8
Statistical Analysis	8
Test Reliability	8
Test Validity	6
Computer Assisted Testing	5
Test Items	5
Testing	5
Equated Scores	4
Essay Tests	4
Estimation (Mathematics)	4
Interrater Reliability	4
Item Analysis	4
Mathematical Models	4
Test Construction	4
Test Interpretation	4
Criterion Referenced Tests	3
English (Second Language)	3
Guessing (Tests)	3
Language Tests	3
Prediction	3
Psychometrics	3
More ▼

Source

Journal of Educational…	5
ETS Research Report Series	2
Applied Measurement in…	1
Applied Psychological…	1
Assessment	1
Chemistry Education Research…	1
Educ Psychol Meas	1
Educational Measurement:…	1
Educational and Psychological…	1
Journal of Educational and…	1
Journal of Psychoeducational…	1
Journal of School Psychology	1
Journal of Technology,…	1
Research Quarterly for…	1
More ▼

Publication Type

Journal Articles	18
Reports - Research	18
Reports - Evaluative	8
Speeches/Meeting Papers	4
Collected Works - General	1
Information Analyses	1
Reports - Descriptive	1
Tests/Questionnaires	1

Education Level

Higher Education	3
Postsecondary Education	2
Elementary Secondary Education	1
Secondary Education	1

Audience

Researchers

Location

Israel	1
Texas	1

Laws, Policies, & Programs

Assessments and Surveys

Test of English as a Foreign…	3
Advanced Placement…	1
Graduate Record Examinations	1
Praxis Series	1
SAT (College Admission Test)	1
Wechsler Intelligence Scale…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 32 results Save | Export

Optimizing Testing Feedback in Introductory Chemistry: A Multi-Treatment Study Exploring Varying Levels of Assessment Feedback and Subsequent Performance

Peer reviewed

Direct link

Kristen L. Murphy; David G. Schreurs; Melonie A. Teichert; Cynthia J. Luxford; Jaclyn M. Trate; Jordan T. Harshmann; Jamie L. Schneider – Chemistry Education Research and Practice, 2024

Providing students with feedback on their performance is a critical part of enhancing student learning in chemistry and is often integrated into homework assignments, quizzes, and exams. However, not all feedback is created equal, and the type of feedback the student receives can dramatically alter the utility of the feedback to reinforce correct…

Descriptors: Student Evaluation, Feedback (Response), Science Education, Introductory Courses

Modeling of Item Response Functions under the D-Scoring Method

Peer reviewed

Direct link

Dimitrov, Dimiter M. – Educational and Psychological Measurement, 2020

This study presents new models for item response functions (IRFs) in the framework of the D-scoring method (DSM) that is gaining attention in the field of educational and psychological measurement and largescale assessments. In a previous work on DSM, the IRFs of binary items were estimated using a logistic regression model (LRM). However, the LRM…

Descriptors: Item Response Theory, Scoring, True Scores, Scaling

Validating Human and Automated Scoring of Essays against "True" Scores

Peer reviewed

Direct link

Cohen, Yoav; Levi, Effi; Ben-Simon, Anat – Applied Measurement in Education, 2018

In the current study, two pools of 250 essays, all written as a response to the same prompt, were rated by two groups of raters (14 or 15 raters per group), thereby providing an approximation to the essay's true score. An automated essay scoring (AES) system was trained on the datasets and then scored the essays using a cross-validation scheme. By…

Descriptors: Test Validity, Automation, Scoring, Computer Assisted Testing

Prediction of Writing True Scores in Automated Scoring of Essays by Best Linear Predictors and Penalized Best Linear Predictors. Research Report. ETS RR-19-13

Peer reviewed
PDF on ERIC

Download full text

Yao, Lili; Haberman, Shelby J.; Zhang, Mo – ETS Research Report Series, 2019

Many assessments of writing proficiency that aid in making high-stakes decisions consist of several essay tasks evaluated by a combination of human holistic scores and computer-generated scores for essay features such as the rate of grammatical errors per word. Under typical conditions, a summary writing score is provided by a linear combination…

Descriptors: Prediction, True Scores, Computer Assisted Testing, Scoring

The Importance of Relying on the Manual: Scoring Error Variance in the WISC-IV Vocabulary Subtest

Peer reviewed

Direct link

Erdodi, Laszlo A.; Richard, David C. S.; Hopwood, Christopher – Journal of Psychoeducational Assessment, 2009

Classical test theory assumes that ability level has no effect on measurement error. Newer test theories, however, argue that the precision of a measurement instrument changes as a function of the examinee's true score. Research has shown that administration errors are common in the Wechsler scales and that subtests requiring subjective scoring…

Descriptors: Scoring, Error of Measurement, True Scores, Intelligence Tests

Reliability and Attribute-Based Scoring in Cognitive Diagnostic Assessment

Peer reviewed

Direct link

Gierl, Mark J.; Cui, Ying; Zhou, Jiawen – Journal of Educational Measurement, 2009

The attribute hierarchy method (AHM) is a psychometric procedure for classifying examinees' test item responses into a set of structured attribute patterns associated with different components from a cognitive model of task performance. Results from an AHM analysis yield information on examinees' cognitive strengths and weaknesses. Hence, the AHM…

Descriptors: Test Items, True Scores, Psychometrics, Algebra

An Empirical Bayes Approach to Subscore Augmentation: How Much Strength Can We Borrow?

Peer reviewed

Direct link

Edwards, Michael C.; Vevea, Jack L. – Journal of Educational and Behavioral Statistics, 2006

This article examines a subscore augmentation procedure. The approach uses empirical Bayes adjustments and is intended to improve the overall accuracy of measurement when information is scant. Simulations examined the impact of the method on subscale scores in a variety of realistic conditions. The authors focused on two popular scoring methods:…

Descriptors: Geometric Concepts, True Scores, Scoring, Item Response Theory

A Note on Decision Theoretic Coefficients for Tests.

Peer reviewed

Wilcox, Rand R. – Applied Psychological Measurement, 1979

Using a new coefficient, a rescaling of the Bayes risk is examined and a modification of this coefficient is described which yields an index that always has a value between zero and one. (Author/MH)

Descriptors: Bayesian Statistics, Measurement Techniques, Scoring, Technical Reports

Confidence Intervals for True Scores under an Answer-until-Correct Scoring Procedure.

Peer reviewed

Wilcox, Rand R. – Journal of Educational Measurement, 1987

Four procedures are discussed for obtaining a confidence interval when answer-until-correct scoring is used in multiple choice tests. Simulated data show that the choice of procedure depends upon sample size. (GDC)

Descriptors: Computer Simulation, Multiple Choice Tests, Sample Size, Scoring

Number Correct Scoring: Comparison between Classical True Score Theory and Multidimensional Item Response Theory.

Download full text

Rotou, Ourania; Elmore, Patricia B.; Headrick, Todd C. – 2001

This study investigated the number-correct scoring method based on different theories (classical true-score theory and multidimensional item response theory) when a standardized test requires more than one ability for an examinee to get a correct response. The number-correct scoring procedure that is widely used is the one that is defined in…

Descriptors: Item Response Theory, Scoring, Standardized Tests, Test Items

A Comparison of IRT Equating and Beta 4 Equating

Peer reviewed

Direct link

Kim, Dong-In; Brennan, Robert; Kolen, Michael – Journal of Educational Measurement, 2005

Four equating methods (3PL true score equating, 3PL observed score equating, beta 4 true score equating, and beta 4 observed score equating) were compared using four equating criteria: first-order equity (FOE), second-order equity (SOE), conditional-mean-squared-error (CMSE) difference, and the equi-percentile equating property. True score…

Descriptors: True Scores, Psychometrics, Equated Scores, Item Response Theory

Mathematical Considerations About the Effects of Guessing on Test Variance.

Koplyay, Janos B.; And Others – 1972

The relationship between true ability (operationally defined as the number of items for which the examinee actually knew the correct answer) and the effects of guessing upon observed test variance was investigated. Three basic hypotheses were treated mathematically: there is no functional relationship between true ability and guessing success;…

Descriptors: Guessing (Tests), Predictor Variables, Probability, Scoring

Definition of True Score Appropriate for Estimated True Scores

Stanley, Julian C. – Educ Psychol Meas, 1970

It is shown that all obtained scores must meet the requirements for classical test-score theory with respect to definitions of true scores and errors of measurement if that frame of reference is to yield valid variance errors of measurement. (DG)

Descriptors: Measurement Techniques, Scores, Scoring, Statistical Analysis

Making Essay Test Scores Fairer with Statistics. ETS Program Statistics Research Technical Report No. 89-90.

Download full text

Braun, Henry I.; Wainer, Howard – 1989

A desirable goal would be to develop a methodology for scoring essays so that the final grades are less affected by when or by whom each essay was read. It seems sensible to derive such grades by somehow adjusting the ratings originally given by each reader. This essay describes a solution that relies on statistical adjustment, using the context…

Descriptors: Essay Tests, Estimation (Mathematics), Interrater Reliability, Scoring

A Reassessment of Standard Error of Measurement.

Download full text

Klaas, Alan C. – 1975

Current usage and theory of standard error of measurement calls for one standard error of measurement figure to be used across all levels of scoring. The study revealed that scoring variance across scoring levels is not constant. As scoring ability increases scoring variance decreases. The assertion that low and high scoring subjects will…

Descriptors: Error of Measurement, Guessing (Tests), Scoring, Statistical Analysis

Previous Page | Next Page »

Pages: 1 | 2 | 3

Wilcox, Rand R.	3
Attali, Yigal	1
Ben-Simon, Anat	1
Blok, H.	1
Braun, Henry I.	1
Brennan, Robert	1
Brennan, Robert L.	1
Brown, Michelle Stallone	1
Cohen, Yoav	1
Cook, Linda L.	1
Cragnolino, Ana	1
Cui, Ying	1
Cynthia J. Luxford	1
D'Costa, Ayres G.	1
David G. Schreurs	1
Dimitrov, Dimiter M.	1
Edwards, Michael C.	1
Eignor, Daniel R.	1
Elmore, Patricia B.	1
Erdodi, Laszlo A.	1
Feldt, Leonard S.	1
Gierl, Mark J.	1
Gleser, Leon Jay	1
Goldberg, Gail Lynn	1
More ▼