NotesFAQContact Us
Collection
Advanced
Search Tips
Showing 751 to 765 of 1,113 results Save | Export
Peer reviewed Peer reviewed
Weitzman, R. A. – Educational and Psychological Measurement, 1996
Application of the Rasch model to data from 267 naval personnel taking a multiple-choice test subject to guessing indicates that goodness of fit is a problem small enough to ignore, but that modification to account for guessing may be necessary for accurate equating. (SLD)
Descriptors: Equated Scores, Goodness of Fit, Guessing (Tests), Item Response Theory
Peer reviewed Peer reviewed
Davenport, Ernest C., Jr. – Educational and Psychological Measurement, 1990
The properties of Tucker's congruence coefficient--an index used to help determine the equality of latent structures for different subgroups on a common set of test items--are explored. An example using data from 1,370 subjects in the National Assessment of Educational Progress 1976 mathematics assessment illustrates the index's use. (SLD)
Descriptors: Equated Scores, Grade 8, Junior High School Students, Junior High Schools
Peer reviewed Peer reviewed
MacCann, Robert G. – Journal of Educational Statistics, 1990
For anchor test equating, 3 linear observed score methods are derived for populations differing in ability. Each version requires that the correlations of the tests with the selection variable be known. Five sets of assumptions are made for each model--yielding 15 methods--which are then related to existing methods. (SLD)
Descriptors: Ability, Ability Grouping, Equated Scores, Equations (Mathematics)
Peer reviewed Peer reviewed
Livingston, Samuel A.; And Others – Applied Measurement in Education, 1990
Combinations of five methods of equating test scores and two methods of selecting samples of students for equating were compared for accuracy, using data from the administration of the Scholastic Aptitude Test to more than 115,000 students. Implications for research and practice are discussed. (SLD)
Descriptors: College Entrance Examinations, Equated Scores, Evaluation Methods, High School Students
Peer reviewed Peer reviewed
Wilson, Mark; Wang, Wen-chung – Applied Psychological Measurement, 1995
Data from the California Learning Assessment System mathematics assessment were used to examine issues that arise when scores from different assessment modes are combined. Multiple-choice, open-ended, and investigation items were combined in a test across three test forms. Results illustrate the difficulties faced in evaluating combined…
Descriptors: Educational Assessment, Equated Scores, Evaluation Methods, Item Response Theory
Peer reviewed Peer reviewed
Livingston, Samuel A. – Journal of Educational Measurement, 1993
The extent to which log-linear smoothing could improve the accuracy of common-item equating by the chained equipercentile method in small samples of examinees was investigated with responses from a 100-item test and 93,283 examinees. Smoothing reduced the sample size required for a given degree of accuracy. (SLD)
Descriptors: Advanced Placement Programs, Equated Scores, Estimation (Mathematics), High School Students
Peer reviewed Peer reviewed
Brennan, Robert L. – Applied Measurement in Education, 1992
A conceptual framework and heuristic model for considering the existence, magnitude, and consequences of context effects are presented through an extension of some generalizability theory concepts. Context effects are often misunderstood, and current measurement models have serious limitations for examining them. Their importance needs to be…
Descriptors: Adaptive Testing, Context Effect, Equated Scores, Equations (Mathematics)
Peer reviewed Peer reviewed
Harris, Deborah J.; Crouse, Jill D. – Applied Measurement in Education, 1993
Criteria used in the equating process proposed in the literature are reviewed. The discussion begins by examining how equating is defined. The controversy over the best criterion, the utility of some, and whether a criterion is needed at all means that much work needs to be done in this area. (SLD)
Descriptors: Data Collection, Definitions, Equated Scores, Evaluation Criteria
Peer reviewed Peer reviewed
PDF on ERIC Download full text
von Davier, Alina A.; Carstensen, Claus H.; von Davier, Matthias – ETS Research Report Series, 2006
Measuring and linking competencies require special instruments, special data collection designs, and special statistical models. The measurement instruments are tests or tests forms, which can be used in the following situations: The same test can be given repeatedly; two or more parallel tests forms (i.e., forms intended to be similar in…
Descriptors: Scores, Measurement Techniques, Competence, Comparative Analysis
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Holland, Paul W.; von Davier, Alina A.; Sinharay, Sandip; Han, Ning – ETS Research Report Series, 2006
This paper focuses on the Non-Equivalent Groups with Anchor Test (NEAT) design for test equating and on two classes of observed--score equating (OSE) methods--chain equating (CE) and poststratification equating (PSE). These two classes of methods reflect two distinctly different ways of using the information provided by the anchor test for…
Descriptors: Equated Scores, Test Items, Statistical Analysis, Comparative Analysis
Kirkup, Catherine; Schagen, Ian; Wheater, Rebecca; Morrison, Jo; Whetton, Chris – National Foundation for Educational Research, 2007
In September 2005 the National Foundation for Educational Research (NFER) in association with the Department for Education and Skills (DfES), the Sutton Trust and the College Board, began a five-year research study to examine the validity of an aptitude test in higher education admissions. This report describes and explores the relationships…
Descriptors: Educational Research, Academic Achievement, Student Surveys, Aptitude Tests
Lunz, Mary E.; Bergstrom, Betty A. – 1995
The Board of Registry (BOR) certifies medical technologists and other laboratory personnel. The BOR has studied adaptive testing for over 6 years and now administers all 17 BOR certification examinations using computerized adaptive testing (CAT). This paper presents an overview of the major research efforts from 1989 to the present related to test…
Descriptors: Adaptive Testing, Computer Assisted Testing, Decision Making, Equated Scores
Way, Walter D.; Reese, Clyde M. – 1991
The use of two alternative item response theory (IRT) estimation models in the scaling and equating of the Test of English as a Foreign Language (TOEFL) was explored; and item scaling and test equating results based on these models were compared with results based on the three-parameter (3PL) model currently being used with the TOEFL. Models were…
Descriptors: Correlation, Equated Scores, Estimation (Mathematics), Goodness of Fit
Morrison, Carol A.; Fitzpatrick, Steven J. – 1992
An attempt was made to determine which item response theory (IRT) equating method results in the least amount of equating error or "scale drift" when equating scores across one or more test forms. An internal anchor test design was employed with five different test forms, each consisting of 30 items, 10 in common with the base test and 5…
Descriptors: Comparative Analysis, Computer Simulation, Equated Scores, Error of Measurement
Bogan, Evelyn Doody; Yen, Wendy M. – 1983
Four multidimensional data configurations and one unidimensional data configuration were simulated for three differences in mean difficulty between two tests to be equated. Two chi-square statistics, Q1 and Q2, were examined for their ability to detect multidimensionality. Results indicated that Q1 did not discriminate between any of the…
Descriptors: Difficulty Level, Equated Scores, Goodness of Fit, Latent Trait Theory
Pages: 1  |  ...  |  47  |  48  |  49  |  50  |  51  |  52  |  53  |  54  |  55  |  ...  |  75