Publication Date
In 2025 | 4 |
Since 2024 | 9 |
Since 2021 (last 5 years) | 58 |
Since 2016 (last 10 years) | 147 |
Since 2006 (last 20 years) | 496 |
Descriptor
Source
Author
Bianchini, John C. | 35 |
von Davier, Alina A. | 34 |
Dorans, Neil J. | 33 |
Kolen, Michael J. | 31 |
Loret, Peter G. | 31 |
Kim, Sooyeon | 26 |
Moses, Tim | 24 |
Livingston, Samuel A. | 22 |
Holland, Paul W. | 20 |
Puhan, Gautam | 20 |
Liu, Jinghua | 19 |
More ▼ |
Publication Type
Education Level
Location
Canada | 9 |
Australia | 8 |
Florida | 8 |
United Kingdom (England) | 8 |
Netherlands | 7 |
New York | 7 |
United States | 7 |
Israel | 6 |
Turkey | 6 |
United Kingdom | 6 |
California | 5 |
More ▼ |
Laws, Policies, & Programs
Elementary and Secondary… | 12 |
No Child Left Behind Act 2001 | 5 |
Education Consolidation… | 3 |
Hawkins Stafford Act 1988 | 1 |
Race to the Top | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Meets WWC Standards without Reservations | 1 |
Meets WWC Standards with or without Reservations | 1 |
Rasp, Alfred Jr. – 1976
This paper focuses on three topics. The first introduces the original Anchor Test Study conducted and reported by Educational Testing Service (ETS) from 1971 to 1974. This study, involving the testing of more than 300,000 children, produced raw score equivalency tables for eight commonly used reading tests and new individual and school-mean norms…
Descriptors: Educational Assessment, Elementary Education, Equated Scores, Grade 4
Texas Education Agency, Austin. – 1998
This digest is designed to provide information to Texas testing coordinators, other educators, and interested citizens about the development procedures and technical attributes of the state-mandated criterion-referenced assessment program. The chapters are: (1) "Background"; (2) "Test Development"; (3) "Test…
Descriptors: Alternative Assessment, Criterion Referenced Tests, Elementary Secondary Education, Equated Scores

Norcini, John J. – Journal of Educational Measurement, 1990
Whether cutting score equivalents (CSEs) based on examinee performance are the same as CSEs based on expert judgment was examined using data from 3,262 examinees taking an internal medicine certification examination. CSEs produced by 40 physicians/experts were closer to the criteria than were standards derived from examinee performance. (SLD)
Descriptors: Certification, Comparative Analysis, Cutting Scores, Decision Making
Li, Yuan H.; Lissitz, Robert W. – Journal of Educational Measurement, 2004
The analytically derived asymptotic standard errors (SEs) of maximum likelihood (ML) item estimates can be approximated by a mathematical function without examinees' responses to test items, and the empirically determined SEs of marginal maximum likelihood estimation (MMLE)/Bayesian item estimates can be obtained when the same set of items is…
Descriptors: Test Items, Computation, Item Response Theory, Error of Measurement

Sireci, Stephen G. – 1995
This paper presents and evaluates a new procedure to classify examinees into meaningful categories based on their responses to (performance on) items comprising a test instrument. The proposed procedure uses the data analytic technique of cluster analysis to partition the examinee population into homogeneous groupings. This procedure was applied…
Descriptors: Classification, Cluster Analysis, Equated Scores, High School Equivalency Programs
Center for Research on Evaluation, Standards, and Student Testing, Los Angeles, CA. – 1988
Three papers and a sample outline of Requests for Proposals (RFPs) represent the work of the Monitoring and Improving Testing and Evaluation Innovations (MITEI) Project during 1988. The skeleton of a sample RFP outline for large-scale assessment was expanded to provide a checklist of the type of information that should be included in a RFP. The…
Descriptors: Agency Role, Check Lists, Construct Validity, Educational Assessment
Murray, Steve – 1988
The gap-reduction model has been identified as a potential alternative to or extension of the Title I Evaluation and Reporting System (TIERS). The gap-reduction model has been recommended for the evaluation of bilingual programs, but has only recently been given consideration for evaluating local Chapter 1 programs. This report recommends that…
Descriptors: Compensatory Education, Correlation, Educational Assessment, Elementary Secondary Education
Beaton, Albert E. – 1985
This paper overviews technical developments in data analysis procedures for the National Assessment of Educational Progress (NAEP) reading data during 1984. The highlight of the reshaping of the NAEP data has been the scaling using item response theory (IRT). AT this point in the data analysis, an IRT-based scale appears appropriate for reading…
Descriptors: Educational Assessment, Elementary Secondary Education, Equated Scores, Item Analysis
Gialluca, Kathleen A.; And Others – 1984
In this study, simulated and actual Air Force test data were used to compare the different procedures for equating mental tests: conventional (equipercentile and linear), Item Response Theory (IRT), and strong true-score theory (STST); data collection designs used were single-group, equivalent-groups, and anchor test. Equating transformations were…
Descriptors: Adults, Cognitive Ability, Cognitive Tests, Comparative Analysis
Ligon, Glynn; Ellis, John – 1986
For Texas's Career Ladder System of rewarding good teachers, teachers' performance evaluations from 1981 to 1984 were used to rank teachers in the Austin Independent School District. Significant biases were noted between raters, between years, and between elementary and secondary teacher ratings. To adjust for these biases, each teacher's raw…
Descriptors: Bias, Career Ladders, Elementary Secondary Education, Equated Scores
Steele, Joe M. – 1986
Assumptions have been made about the growth and development of reasoning and communicating skills in college. This study describes a standardized set of assessments of reasoning, writing and speaking with norms for freshmen and seniors and multiple equated forms. The study provides evidence of the validity and reliability of these assessments.…
Descriptors: Abstract Reasoning, Cognitive Processes, Cognitive Tests, College Freshmen
Samejima, Fumiko – 1984
The use of a three-parameter logistic model for applying latent trait theory has become more popular because of the availability of computer programs. The program Logist 5 can be used not only for the item parameter estimation in the three-parameter logistic model, but also in the two parameter logistic model by setting the third parameter equal…
Descriptors: Correlation, Elementary Secondary Education, Equated Scores, Estimation (Mathematics)
Eisenberg, Eric M.; Book, Cassandra L. – 1980
Guidelines are described for setting up an item bank under latent trait theory which may be applied to the achievement testing system of multi-section, large-enrollment, college survey courses. The enrollment for the course is typically heterogeneous: students may be majors or non-majors, any one section may contain honors college students and…
Descriptors: Achievement Tests, Course Content, Equated Scores, Goodness of Fit
Goulet, Larry R.; And Others – 1975
The problems and issues involved in the conduct of educational-developmental research are examined within the perspective of longitudinal research methodology. Chapters 2 and 3 examine contemporary research designs and procedures implemented for the selection of subjects and testing of behavior over time. Particular attention is given to the…
Descriptors: Behavior Change, Cognitive Development, Educational Research, Equated Scores
Bianchini, John C.; Loret, Peter G. – 1974
The Anchor Test Study provides a method for translating a pupil's score on any one of eight widely used standardized reading tests for Grades 4, 5, and 6 to a corresponding score of any of the other seven tests, as well as furnishing new nationally representative norms for each of the eight tests. In addition, the Study presents new estimates of…
Descriptors: Elementary School Students, Equated Scores, Grade 4, Grade 5