ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	1
Since 2006 (last 20 years)	5

Descriptor

Comparative Analysis	11
Error of Measurement	11
Raw Scores	11
Equated Scores	4
Mathematical Models	4
Statistical Analysis	4
Test Reliability	4
Psychometrics	3
Test Format	3
College Entrance Examinations	2
Elementary Education	2
Goodness of Fit	2
Item Analysis	2
Item Response Theory	2
Measurement Techniques	2
Multiple Choice Tests	2
Probability	2
Reading Comprehension	2
Reading Tests	2
Reliability	2
Scores	2
Standardized Tests	2
Vocabulary	2
Accuracy	1
Achievement Gains	1
More ▼

Source

ETS Research Report Series	2
Educational Measurement:…	1
Journal of Educational…	1
Thought & Action	1

Author

Bashaw, W. L.	2
Lee, Won-Chan	2
Liu, Jinghua	2
Rentz, R. Robert	2
Ackerman, Terry A.	1
Choi, Jiwon	1
Curley, Edward	1
Dorans, Neil	1
Evans, John A.	1
Francis, Richard W.	1
Guo, Hongwen	1
Kang, Yujin	1
Kim, Stella Y.	1
Kolen, Michael J.	1
Low, Albert C.	1
Marston, Paul T., Borich,…	1
Richard, James M., Jr.	1
Schumacker, Randall E.	1
More ▼

Publication Type

Journal Articles	5
Reports - Research	4
Reports - Evaluative	3
Reports - Descriptive	2
Speeches/Meeting Papers	2
Information Analyses	1
Numerical/Quantitative Data	1

Education Level

Higher Education	3
Postsecondary Education	3
High Schools	1
Secondary Education	1

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

SAT (College Admission Test)	2
Praxis Series	1

What Works Clearinghouse Rating

Showing all 11 results Save | Export

IRT Approaches to Modeling Scores on Mixed-Format Tests

Peer reviewed

Direct link

Lee, Won-Chan; Kim, Stella Y.; Choi, Jiwon; Kang, Yujin – Journal of Educational Measurement, 2020

This article considers psychometric properties of composite raw scores and transformed scale scores on mixed-format tests that consist of a mixture of multiple-choice and free-response items. Test scores on several mixed-format tests are evaluated with respect to conditional and overall standard errors of measurement, score reliability, and…

Descriptors: Raw Scores, Item Response Theory, Test Format, Multiple Choice Tests

Psychometric Properties of Raw and Scale Scores on Mixed-Format Tests

Peer reviewed

Direct link

Kolen, Michael J.; Lee, Won-Chan – Educational Measurement: Issues and Practice, 2011

This paper illustrates that the psychometric properties of scores and scales that are used with mixed-format educational tests can impact the use and interpretation of the scores that are reported to examinees. Psychometric properties that include reliability and conditional standard errors of measurement are considered in this paper. The focus is…

Descriptors: Test Use, Test Format, Error of Measurement, Raw Scores

The Stability of the Score Scales for the "SAT Reasoning Test"™ from 2005 to 2010. Research Report. ETS RR-12-15

Peer reviewed
PDF on ERIC

Download full text

Guo, Hongwen; Liu, Jinghua; Curley, Edward; Dorans, Neil – ETS Research Report Series, 2012

This study examines the stability of the "SAT Reasoning Test"™ score scales from 2005 to 2010. A 2005 old form (OF) was administered along with a 2010 new form (NF). A new conversion for OF was derived through direct equipercentile equating. A comparison of the newly derived and the original OF conversions showed that Critical Reading…

Descriptors: Aptitude Tests, Cognitive Tests, Thinking Skills, Equated Scores

An Exploration of Kernel Equating Using SAT® Data: Equating to a Similar Population and to a Distant Population. Research Report. ETS RR-07-17

Peer reviewed
PDF on ERIC

Download full text

Liu, Jinghua; Low, Albert C. – ETS Research Report Series, 2007

This study applied kernel equating (KE) in two scenarios: equating to a very similar population and equating to a very different population, referred to as a distant population, using SAT® data. The KE results were compared to the results obtained from analogous classical equating methods in both scenarios. The results indicate that KE results are…

Descriptors: College Entrance Examinations, Equated Scores, Comparative Analysis, Evaluation Methods

Comparing Measurement Theories.

Download full text

Schumacker, Randall E. – 1998

In comparing measurement theories, it is evident that the awareness of the concept of measurement error during the time of Galileo has lead to the formulation of observed scores comprising a true score and error (classical theory), universe score and various random error components (generalizability theory), or individual latent ability and error…

Descriptors: Comparative Analysis, Computer Software, Error of Measurement, Generalizability Theory

Common Errors in Calculating Final Grades

Peer reviewed

Direct link

Francis, Richard W. – Thought & Action, 2006

The author has discovered that errors in grades often occur when scores are combined for final marks. These errors are not related to the grading individual assignments. Rather, they occur when teachers at all grade levels bring individual test and assignment scores together for the students' final grades. Unfortunately, professors of mathematics…

Descriptors: Error Patterns, Scores, Grades (Scholastic), Error Correction

Regression Weights and Communication among Researchers from Different Disciplines.

Richard, James M., Jr. – 1979

This report examines the difference in approaches between sociologists and psychologists when using multiple regression techniques in the analysis of behavioral data. Psychologists and sociologists are often divided in their orientation toward regression techniques, and this division could be a substantial and unfortunate barrier to communication…

Descriptors: Comparative Analysis, Data Analysis, Differences, Error of Measurement

An Investigation of the Relationship between Reliability, Power, and the Type I Error Rate of the Mantel-Haenszel and Simultaneous Item Bias Detection Procedures.

Download full text

Ackerman, Terry A.; Evans, John A. – 1992

The relationship between levels of reliability and the power of two bias and differential item functioning (DIF) detection methods is examined. Both methods, the Mantel-Haenszel (MH) procedure of P. W. Holland and D. T. Thayer (1988) and the Simultaneous Item Bias (SIB) procedure of R. Shealy and W. Stout (1991), use examinees' raw scores as a…

Descriptors: Comparative Analysis, Equations (Mathematics), Error of Measurement, Item Bias

Analysis of Covariance: Is It the Appropriate Model to Study Change?

Download full text

Marston, Paul T., Borich, Gary D. – 1977

The four main approaches to measuring treatment effects in schools; raw gain, residual gain, covariance, and true scores; were compared. A simulation study showed true score analysis produced a large number of Type-I errors. When corrected for this error, this method showed the least power of the four. This outcome was clearly the result of the…

Descriptors: Achievement Gains, Analysis of Covariance, Comparative Analysis, Error of Measurement

Equating Reading Tests With the Rasch Model. Volume I, Final Report.

Download full text

Rentz, R. Robert; Bashaw, W. L. – 1975

In order to determine if Rasch Model procedures have any utility for equating pre-existing tests, this study reanalyzed the data from the equating phase of the Anchor Test Study which used a variety of equipercentile and linear model methods. The tests involved included seven reading test batteries, each having from one to three levels and two…

Descriptors: Comparative Analysis, Elementary Education, Equated Scores, Error of Measurement

Equating Reading Tests With the Rasch Model. Volume II, Technical Reference Tables.

Download full text

Rentz, R. Robert; Bashaw, W. L. – 1975

This volume contains tables of item analysis results obtained by following procedures associated with the Rasch Model for those reading tests used in the Anchor Test Study. Appendix I gives the test names and their corresponding analysis code numbers. Section I (Basic Item Analyses) presents data for the item analysis of each test in a two part…

Descriptors: Comparative Analysis, Elementary Education, Equated Scores, Error of Measurement