ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	6

Descriptor

Raw Scores	35
True Scores	35
Error of Measurement	14
Statistical Analysis	11
Mathematical Models	10
Test Reliability	10
Equated Scores	7
Item Response Theory	7
Reliability	7
Test Interpretation	6
Correlation	5
Probability	5
Scores	5
Analysis of Variance	4
Criterion Referenced Tests	4
Estimation (Mathematics)	4
Standardized Tests	4
Testing Problems	4
Achievement Gains	3
Bayesian Statistics	3
College Entrance Examinations	3
Comparative Analysis	3
Computation	3
Elementary Education	3
Equations (Mathematics)	3
More ▼

Source

Applied Psychological…	3
Psychometrika	3
Educational and Psychological…	2
Journal of Educational…	2
Journal of Educational and…	2
Advances in Health Sciences…	1
Applied Measurement in…	1
ETS Research Report Series	1
Educational Measurement:…	1
Educational Testing Service	1
Journal of School Psychology	1
More ▼

Publication Type

Reports - Research	12
Journal Articles	10
Reports - Evaluative	10
Numerical/Quantitative Data	3
Speeches/Meeting Papers	3
Reports - Descriptive	2
Reports - General	1

Education Level

Higher Education	2
Junior High Schools	1
Postsecondary Education	1

Audience

Location

Canada	1
Taiwan	1

Laws, Policies, & Programs

Assessments and Surveys

ACT Assessment	2
SAT (College Admission Test)	2
College Level Examination…	1
Differential Aptitude Test	1
General Aptitude Test Battery	1
Goodenough Harris Drawing Test	1
Graduate Record Examinations	1
Iowa Tests of Basic Skills	1
Vineland Adaptive Behavior…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 35 results Save | Export

Confidence Intervals for True Scores Using the Skew-Normal Distribution

Peer reviewed

Direct link

Garcia-Perez, Miguel A. – Journal of Educational and Behavioral Statistics, 2010

A recent comparative analysis of alternative interval estimation approaches and procedures has shown that confidence intervals (CIs) for true raw scores determined with the Score method--which uses the normal approximation to the binomial distribution--have actual coverage probabilities that are closest to their nominal level. It has also recently…

Descriptors: Computation, Statistical Analysis, True Scores, Raw Scores

Does Preequating Work? An Investigation into a Preequated Testlet-Based College Placement Exam Using Postadministration Data. Research Report. ETS RR-12-12

Peer reviewed
PDF on ERIC

Download full text

Gao, Rui; He, Wei; Ruan, Chunyi – ETS Research Report Series, 2012

In this study, we investigated whether preequating results agree with equating results that are based on observed operational data (postequating) for a college placement program. Specifically, we examined the degree to which item response theory (IRT) true score preequating results agreed with those from IRT true score postequating and from…

Descriptors: College Entrance Examinations, Student Placement, Item Response Theory, True Scores

Small-Sample Equating with Prior Information. Research Report. ETS RR-09-25

Download full text

Livingston, Samuel A.; Lewis, Charles – Educational Testing Service, 2009

This report proposes an empirical Bayes approach to the problem of equating scores on test forms taken by very small numbers of test takers. The equated score is estimated separately at each score point, making it unnecessary to model either the score distribution or the equating transformation. Prior information comes from equatings of other…

Descriptors: Test Length, Equated Scores, Bayesian Statistics, Sample Size

Undesired Variance Due to Examiner Stringency/Leniency Effect in Communication Skill Scores Assessed in OSCEs

Peer reviewed

Direct link

Harasym, Peter H.; Woloschuk, Wayne; Cunning, Leslie – Advances in Health Sciences Education, 2008

Physician-patient communication is a clinical skill that can be learned and has a positive impact on patient satisfaction and health outcomes. A concerted effort at all medical schools is now directed at teaching and evaluating this core skill. Student communication skills are often assessed by an Objective Structure Clinical Examination (OSCE).…

Descriptors: Medical Schools, Family Practice (Medicine), Examiners, Error of Measurement

An Interval Estimate for Making Statistical Inferences About True Scores

Peer reviewed

Lord, Frederic M.; Stocking, Martha L. – Psychometrika, 1976

A numerical procedure is outlined for obtaining an interval estimate of the regression of true score or observed score, utilizing only the frequency distribution of observed scores. The procedure assumes that the conditional distribution of observed scores for fixed true scores is binomial. Several illustrations are given. (Author/HG)

Descriptors: Correlation, Multiple Regression Analysis, Raw Scores, Statistical Analysis

Establishing the Reliability of Student Proficiency Classifications: The Accuracy of Observed Classifications.

Download full text

Hoffman, R. Gene; Wise, Lauress L. – 2000

Classical test theory is based on the concept of a true score for each examinee, defined as the expected or average score across an infinite number of repeated parallel tests. In most cases, there is only a score from a single administration of the test in question. The difference between this single observed score and the underlying true score is…

Descriptors: Achievement, Classification, Observation, Probability

A Comparison among IRT True- and Observed-Score Equatings and Traditional Equipercentile Equating.

Peer reviewed

Han, Tianqi; And Others – Applied Measurement in Education, 1997

Stability among equating procedures was studied by comparing item response theory (IRT) true-score equating with IRT observed-score equating, IRT true-score equating with equipercentile equating, and IRT observed-score equating with equipercentile equating. On average, IRT true-score equating more frequently produced more stable conversions. (SLD)

Descriptors: Comparative Analysis, Equated Scores, Item Response Theory, Raw Scores

Interval Estimation for True Raw and Scale Scores under the Binomial Error Model

Peer reviewed

Direct link

Lee, Won-Chan; Brennan, Robert L.; Kolen, Michael J. – Journal of Educational and Behavioral Statistics, 2006

Assuming errors of measurement are distributed binomially, this article reviews various procedures for constructing an interval for an individual's true number-correct score; presents two general interval estimation procedures for an individual's true scale score (i.e., normal approximation and endpoints conversion methods); compares various…

Descriptors: Probability, Intervals, Guidelines, Computer Simulation

Coefficients for Tests from a Decision Theoretic Point of View

Peer reviewed

Vander Linden, Wim J.; Mellenbergh, Gideon J. – Applied Psychological Measurement, 1978

A general coefficient for tests, delta, is derived from a decision theoretic point of view. The situations are considered in which a true score is estimated by a function of the observed score, observed scores are split into more than two categories, and observed scores are split into only two categories. (Author/CTM)

Descriptors: Criterion Referenced Tests, Decision Making, Mathematical Models, Raw Scores

A Simple Proof of the Spearman-Brown Formula for Continuous Test Lengths

Peer reviewed

Allison, Paul A. – Psychometrika, 1976

A direct proof is given for the generalized Spearman-Brown formula for any real multiple of test length. (Author)

Descriptors: Correlation, Error of Measurement, Raw Scores, Test Length

The Stability Coefficient

Peer reviewed

Cureton, Edward E. – Educational and Psychological Measurement, 1971

A derivation of a formula for the stability coefficient is presented and discussed in terms of test reliability over time. (PR)

Descriptors: Error of Measurement, Raw Scores, Statistical Analysis, Test Reliability

Standard Errors of Prediction for the Vineland Adaptive Behavior Scales.

Peer reviewed

Atkinson, Leslie – Journal of School Psychology, 1990

Offers standard errors of prediction and confidence intervals for Vineland Adaptive Behavior Scales (VABS) that help in deciding whether variation in obtained scores of scale administered to the same person more than once is a result of measurement error or whether it reflects actual change in examinee's functional level. Presented values were…

Descriptors: Error of Measurement, Foreign Countries, Raw Scores, Test Interpretation

The Effects on Observed- and True-Score Equating Procedures of Matching on a Fallible Criterion: A Simulation with Test Variation.

Download full text

Eignor, Daniel R.; And Others – 1995

Two recent simulation studies were conducted to aid in the diagnosis and interpretation of equating differences found between random and matched (nonrandom) samples for four commonly used equating procedures: (1) Tucker; (2) Levine equally reliable; (3) Chained equipercentile observed-score; and (4) three-parameter, item response theory true-score…

Descriptors: Criteria, Equated Scores, Item Response Theory, Raw Scores

Project TALENT Tests as Anchors for Equating Other Tests.

Cureton, Edward E. – 1973

Presented are the methodology and results of an equipercentile equating study in which subtests of the following three editions of multiple aptitude test batteries, in widespread use in 1960, were equated to the tests of the Project TALENT test battery: Flanagan Aptitude Classification Tests (1957); Differential Aptitude Tests (1947) and; the…

Descriptors: Aptitude Tests, Equated Scores, Raw Scores, Secondary Education

Some Observations on the Estimation of True Scores.

Livingston, Samuel A. – 1970

The procedure of estimating true scores by means of a transformation of the obtained score based on the reliability coefficient is compared with the use of the obtained score without transformation. Using the mean squared error as a criterion, the transformed score is a better estimate for most examinees but poorer for those whose true scores lie…

Descriptors: Analysis of Variance, Measurement, Raw Scores, Scores

Previous Page | Next Page »

Pages: 1 | 2 | 3

Livingston, Samuel A.	3
Cureton, Edward E.	2
Eignor, Daniel R.	2
Kolen, Michael J.	2
Mellenbergh, Gideon J.	2
Allison, Paul A.	1
Atkinson, Leslie	1
Banks, Karen	1
Brennan, Robert L.	1
Chang, Shun-Wen	1
Cizek, Gregory J.	1
Cross, Lawrence H.	1
Cunning, Leslie	1
Dulaney, Chuck	1
Epstein, Kenneth I.	1
Gao, Rui	1
Garcia-Perez, Miguel A.	1
Han, Tianqi	1
Harasym, Peter H.	1
Harris, Chester W.	1
Harris, Dale B.	1
Harvill, Leo M.	1
He, Wei	1
Hoffman, R. Gene	1
More ▼