NotesFAQContact Us
Collection
Advanced
Search Tips
Showing 751 to 765 of 1,116 results Save | Export
Peer reviewed Peer reviewed
Livingston, Samuel A. – Journal of Educational Measurement, 1993
The extent to which log-linear smoothing could improve the accuracy of common-item equating by the chained equipercentile method in small samples of examinees was investigated with responses from a 100-item test and 93,283 examinees. Smoothing reduced the sample size required for a given degree of accuracy. (SLD)
Descriptors: Advanced Placement Programs, Equated Scores, Estimation (Mathematics), High School Students
Peer reviewed Peer reviewed
Brennan, Robert L. – Applied Measurement in Education, 1992
A conceptual framework and heuristic model for considering the existence, magnitude, and consequences of context effects are presented through an extension of some generalizability theory concepts. Context effects are often misunderstood, and current measurement models have serious limitations for examining them. Their importance needs to be…
Descriptors: Adaptive Testing, Context Effect, Equated Scores, Equations (Mathematics)
Peer reviewed Peer reviewed
Harris, Deborah J.; Crouse, Jill D. – Applied Measurement in Education, 1993
Criteria used in the equating process proposed in the literature are reviewed. The discussion begins by examining how equating is defined. The controversy over the best criterion, the utility of some, and whether a criterion is needed at all means that much work needs to be done in this area. (SLD)
Descriptors: Data Collection, Definitions, Equated Scores, Evaluation Criteria
Kobrin, Jennifer L.; Melican, Gerald J. – College Board, 2007
This report synthesizes the research to date addressing the construct comparability of the SAT Reasoning Test and prior SAT I: Reasoning Test and the series of research studies addressing the equatability and subpopulation invariance of the SAT and SAT I.
Descriptors: College Entrance Examinations, Logical Thinking, Thinking Skills, Scores
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Kim, Sooyeon; von Davier, Alina A.; Haberman, Shelby – ETS Research Report Series, 2006
This study addresses the sample error and linking bias that occur with small and unrepresentative samples in a non-equivalent groups anchor test (NEAT) design. We propose a linking method called the "synthetic function," which is a weighted average of the identity function (the trivial equating function for forms that are known to be…
Descriptors: Equated Scores, Sample Size, Test Items, Statistical Bias
Peer reviewed Peer reviewed
Direct linkDirect link
van der Linden, Wim J. – Applied Psychological Measurement, 2006
Traditionally, error in equating observed scores on two versions of a test is defined as the difference between the transformations that equate the quantiles of their distributions in the sample and population of test takers. But it is argued that if the goal of equating is to adjust the scores of test takers on one version of the test to make…
Descriptors: Equated Scores, Evaluation Criteria, Models, Error of Measurement
Lunz, Mary E.; Bergstrom, Betty A. – 1995
The Board of Registry (BOR) certifies medical technologists and other laboratory personnel. The BOR has studied adaptive testing for over 6 years and now administers all 17 BOR certification examinations using computerized adaptive testing (CAT). This paper presents an overview of the major research efforts from 1989 to the present related to test…
Descriptors: Adaptive Testing, Computer Assisted Testing, Decision Making, Equated Scores
Way, Walter D.; Reese, Clyde M. – 1991
The use of two alternative item response theory (IRT) estimation models in the scaling and equating of the Test of English as a Foreign Language (TOEFL) was explored; and item scaling and test equating results based on these models were compared with results based on the three-parameter (3PL) model currently being used with the TOEFL. Models were…
Descriptors: Correlation, Equated Scores, Estimation (Mathematics), Goodness of Fit
Morrison, Carol A.; Fitzpatrick, Steven J. – 1992
An attempt was made to determine which item response theory (IRT) equating method results in the least amount of equating error or "scale drift" when equating scores across one or more test forms. An internal anchor test design was employed with five different test forms, each consisting of 30 items, 10 in common with the base test and 5…
Descriptors: Comparative Analysis, Computer Simulation, Equated Scores, Error of Measurement
Bogan, Evelyn Doody; Yen, Wendy M. – 1983
Four multidimensional data configurations and one unidimensional data configuration were simulated for three differences in mean difficulty between two tests to be equated. Two chi-square statistics, Q1 and Q2, were examined for their ability to detect multidimensionality. Results indicated that Q1 did not discriminate between any of the…
Descriptors: Difficulty Level, Equated Scores, Goodness of Fit, Latent Trait Theory
Peer reviewed Peer reviewed
Fleming, Margaret – Journal of Educational Measurement, 1975
The Anchor Test Study Manual was reviewed with the practitioner in mind. It represents an effort to equate and standardize eight commonly used elementary reading tests. Possibilities and limitations in using the manual are discussed. (BJG)
Descriptors: Achievement Tests, Book Reviews, Comparative Analysis, Elementary Education
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Dorans, Neil J.; Zeller, Karin – ETS Research Report Series, 2004
In an article published in the spring 2003 issue of "Harvard Educational Review", Roy Freedle stated that the SAT® is both culturally and statistically biased. Freedle proposed a solution to this bias, which involved using a half-test made up of the most difficult items culled from complete SAT examination. His claims, which garnered…
Descriptors: Scores, Scoring, Equated Scores, College Entrance Examinations
Hwang, Chi-en; Cleary, T. Anne – 1986
The results obtained from two basic types of pre-equatings of tests were compared: the item response theory (IRT) pre-equating and section pre-equating (SPE). The simulated data were generated from a modified three-parameter logistic model with a constant guessing parameter. Responses of two replication samples of 3000 examinees on two 72-item…
Descriptors: Computer Simulation, Equated Scores, Latent Trait Theory, Mathematical Models
Green, Donald Ross – 1986
Uses of the variety of scores generated by standardized achievement tests are discussed. Desirable characteristics of scales, raw score scales, percent of correct items, percentile ranks, grade equivalents, normal curve equivalents, and scale scores are considered. The various meanings and purposes of each type of score are discussed. It is…
Descriptors: Achievement Tests, Elementary Secondary Education, Equated Scores, Grade Equivalent Scores
Livingston, Samuel A. – 1984
Much previously published material for estimating the reliability of classification has been based on the assumption that a test consists of a known number of equally weighted items. The test score is the number of those items answered correctly. These methods cannot be used with classifications based on weighted composite scores, especially if…
Descriptors: Equated Scores, Essay Tests, Estimation (Mathematics), Mathematical Models
Pages: 1  |  ...  |  47  |  48  |  49  |  50  |  51  |  52  |  53  |  54  |  55  |  ...  |  75