NotesFAQContact Us
Collection
Advanced
Search Tips
Publication Date
In 20250
Since 20240
Since 2021 (last 5 years)0
Since 2016 (last 10 years)0
Since 2006 (last 20 years)6
Audience
Location
Canada1
Taiwan1
Laws, Policies, & Programs
What Works Clearinghouse Rating
Showing 1 to 15 of 35 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Garcia-Perez, Miguel A. – Journal of Educational and Behavioral Statistics, 2010
A recent comparative analysis of alternative interval estimation approaches and procedures has shown that confidence intervals (CIs) for true raw scores determined with the Score method--which uses the normal approximation to the binomial distribution--have actual coverage probabilities that are closest to their nominal level. It has also recently…
Descriptors: Computation, Statistical Analysis, True Scores, Raw Scores
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Gao, Rui; He, Wei; Ruan, Chunyi – ETS Research Report Series, 2012
In this study, we investigated whether preequating results agree with equating results that are based on observed operational data (postequating) for a college placement program. Specifically, we examined the degree to which item response theory (IRT) true score preequating results agreed with those from IRT true score postequating and from…
Descriptors: College Entrance Examinations, Student Placement, Item Response Theory, True Scores
Livingston, Samuel A.; Lewis, Charles – Educational Testing Service, 2009
This report proposes an empirical Bayes approach to the problem of equating scores on test forms taken by very small numbers of test takers. The equated score is estimated separately at each score point, making it unnecessary to model either the score distribution or the equating transformation. Prior information comes from equatings of other…
Descriptors: Test Length, Equated Scores, Bayesian Statistics, Sample Size
Peer reviewed Peer reviewed
Direct linkDirect link
Harasym, Peter H.; Woloschuk, Wayne; Cunning, Leslie – Advances in Health Sciences Education, 2008
Physician-patient communication is a clinical skill that can be learned and has a positive impact on patient satisfaction and health outcomes. A concerted effort at all medical schools is now directed at teaching and evaluating this core skill. Student communication skills are often assessed by an Objective Structure Clinical Examination (OSCE).…
Descriptors: Medical Schools, Family Practice (Medicine), Examiners, Error of Measurement
Peer reviewed Peer reviewed
Lord, Frederic M.; Stocking, Martha L. – Psychometrika, 1976
A numerical procedure is outlined for obtaining an interval estimate of the regression of true score or observed score, utilizing only the frequency distribution of observed scores. The procedure assumes that the conditional distribution of observed scores for fixed true scores is binomial. Several illustrations are given. (Author/HG)
Descriptors: Correlation, Multiple Regression Analysis, Raw Scores, Statistical Analysis
Hoffman, R. Gene; Wise, Lauress L. – 2000
Classical test theory is based on the concept of a true score for each examinee, defined as the expected or average score across an infinite number of repeated parallel tests. In most cases, there is only a score from a single administration of the test in question. The difference between this single observed score and the underlying true score is…
Descriptors: Achievement, Classification, Observation, Probability
Peer reviewed Peer reviewed
Han, Tianqi; And Others – Applied Measurement in Education, 1997
Stability among equating procedures was studied by comparing item response theory (IRT) true-score equating with IRT observed-score equating, IRT true-score equating with equipercentile equating, and IRT observed-score equating with equipercentile equating. On average, IRT true-score equating more frequently produced more stable conversions. (SLD)
Descriptors: Comparative Analysis, Equated Scores, Item Response Theory, Raw Scores
Peer reviewed Peer reviewed
Direct linkDirect link
Lee, Won-Chan; Brennan, Robert L.; Kolen, Michael J. – Journal of Educational and Behavioral Statistics, 2006
Assuming errors of measurement are distributed binomially, this article reviews various procedures for constructing an interval for an individual's true number-correct score; presents two general interval estimation procedures for an individual's true scale score (i.e., normal approximation and endpoints conversion methods); compares various…
Descriptors: Probability, Intervals, Guidelines, Computer Simulation
Peer reviewed Peer reviewed
Vander Linden, Wim J.; Mellenbergh, Gideon J. – Applied Psychological Measurement, 1978
A general coefficient for tests, delta, is derived from a decision theoretic point of view. The situations are considered in which a true score is estimated by a function of the observed score, observed scores are split into more than two categories, and observed scores are split into only two categories. (Author/CTM)
Descriptors: Criterion Referenced Tests, Decision Making, Mathematical Models, Raw Scores
Peer reviewed Peer reviewed
Allison, Paul A. – Psychometrika, 1976
A direct proof is given for the generalized Spearman-Brown formula for any real multiple of test length. (Author)
Descriptors: Correlation, Error of Measurement, Raw Scores, Test Length
Peer reviewed Peer reviewed
Cureton, Edward E. – Educational and Psychological Measurement, 1971
A derivation of a formula for the stability coefficient is presented and discussed in terms of test reliability over time. (PR)
Descriptors: Error of Measurement, Raw Scores, Statistical Analysis, Test Reliability
Peer reviewed Peer reviewed
Atkinson, Leslie – Journal of School Psychology, 1990
Offers standard errors of prediction and confidence intervals for Vineland Adaptive Behavior Scales (VABS) that help in deciding whether variation in obtained scores of scale administered to the same person more than once is a result of measurement error or whether it reflects actual change in examinee's functional level. Presented values were…
Descriptors: Error of Measurement, Foreign Countries, Raw Scores, Test Interpretation
Eignor, Daniel R.; And Others – 1995
Two recent simulation studies were conducted to aid in the diagnosis and interpretation of equating differences found between random and matched (nonrandom) samples for four commonly used equating procedures: (1) Tucker; (2) Levine equally reliable; (3) Chained equipercentile observed-score; and (4) three-parameter, item response theory true-score…
Descriptors: Criteria, Equated Scores, Item Response Theory, Raw Scores
Cureton, Edward E. – 1973
Presented are the methodology and results of an equipercentile equating study in which subtests of the following three editions of multiple aptitude test batteries, in widespread use in 1960, were equated to the tests of the Project TALENT test battery: Flanagan Aptitude Classification Tests (1957); Differential Aptitude Tests (1947) and; the…
Descriptors: Aptitude Tests, Equated Scores, Raw Scores, Secondary Education
Livingston, Samuel A. – 1970
The procedure of estimating true scores by means of a transformation of the obtained score based on the reliability coefficient is compared with the use of the obtained score without transformation. Using the mean squared error as a criterion, the transformed score is a better estimate for most examinees but poorer for those whose true scores lie…
Descriptors: Analysis of Variance, Measurement, Raw Scores, Scores
Previous Page | Next Page ยป
Pages: 1  |  2  |  3