NotesFAQContact Us
Collection
Advanced
Search Tips
Showing 451 to 465 of 1,113 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Ross, John A.; Gray, Peter – Alberta Journal of Educational Research, 2008
We examined how much agreement there was between scores from large-scale mandated assessments and report-card grades for 14,776 students in grades 3, 6, and 9 of a district in which conditions were conducive to alignment of assessments. We found significant mean differences between internal and external assessments: effect sizes were 0.29 to 0.63…
Descriptors: Student Evaluation, Grades (Scholastic), Measures (Individuals), Effect Size
Nering, Michael L., Ed.; Ostini, Remo, Ed. – Routledge, Taylor & Francis Group, 2010
This comprehensive "Handbook" focuses on the most used polytomous item response theory (IRT) models. These models help us understand the interaction between examinees and test questions where the questions have various response categories. The book reviews all of the major models and includes discussions about how and where the models…
Descriptors: Guides, Item Response Theory, Test Items, Correlation
Tanguma, Jesus – 2000
This paper describes four commonly used designs in equating test scores. These designs are: (1) single-group; (2) random-group; (3) equivalent-group; and (4) anchor-test. Each design requires that its data be collected according to specific guidelines. Three of the four methods are illustrated through hypothetical examples. All four methods try to…
Descriptors: Equated Scores, Test Format
von Davier, Alina A.; Holland, Paul W.; Thayer, Dorothy – 2002
The Non-Equivalent-groups Anchor Test (NEAT) design involves two populations, "P" and "Q," of test takes and makes use of an anchor test to link them. Two observed-score equating methods used for NEAT designs are those based on chain equating and those using the anchor to poststratify the distributions of the two operational…
Descriptors: Equated Scores, Statistical Analysis
Chu, Kwang-lee; Kamata, Akihito – 2000
The quality of nonequivalent group equating by the one-parameter hierarchical generalized linear logistic model (1-P HGLLM) was examined by comparing it with: (1) traditional concurrent equating; (2) Stocking-Lord's method; and (3) multiple-group concurrent equating. Root mean squared errors (RMSEs) for item parameters indicated that there was no…
Descriptors: Equated Scores, Groups, Models
Keats, John B. – Educ Psychol Meas, 1970
Descriptors: Computer Programs, Equated Scores
Walters, Allison M. – ProQuest LLC, 2009
Four-year colleges and universities submit faculty teaching load and instructional cost data annually to the Delaware Study of Instructional Costs and Productivity. While the Delaware Study currently adjusts the calculation of annual FTE students to account for the difference in annual student credit hours (SCH) earned by students at semester and…
Descriptors: Semester System, Full Time Equivalency, Teaching Load, Doctoral Dissertations
McGlynn, Angela Provitera – Education Digest: Essential Readings Condensed for Quick Review, 2008
A new report, "The Proficiency Illusion," released last year by the Thomas B. Fordham Institute states that the tests that states use to measure academic progress under the No Child Left Behind Act (NCLB) are creating a false impression of success, especially in reading and especially in the early grades. The report is a collaboration…
Descriptors: Federal Legislation, Academic Achievement, Rating Scales, Achievement Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Saida, Chisato; Hattori, Tamaki – Language Testing, 2008
Despite growing concerns about declining scholastic abilities of Japanese students throughout Japan prior to the implementation of the revised Courses of Study in 2002, little empirical evidence was available at that time to support this perceived decline in academic performance. This research describes post-hoc IRT equating of previously…
Descriptors: Language Tests, Measures (Individuals), Foreign Countries, Item Response Theory
Peer reviewed Peer reviewed
Direct linkDirect link
von Davier, Alina A.; Wilson, Christine – Educational and Psychological Measurement, 2007
This article discusses the assumptions required by the item response theory (IRT) true-score equating method (with Stocking & Lord, 1983; scaling approach), which is commonly used in the nonequivalent groups with an anchor data-collection design. More precisely, this article investigates the assumptions made at each step by the IRT approach to…
Descriptors: Calculus, Item Response Theory, Scores, Data Collection
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Liu, Jinghua; Zhu, Xiaowen – ETS Research Report Series, 2008
The purpose of this paper is to explore methods to approximate population invariance without conducting multiple linkings for subpopulations. Under the single group or equivalent groups design, no linking needs to be performed for the parallel-linear system linking functions. The unequated raw score information can be used as an approximation. For…
Descriptors: Raw Scores, Test Format, Comparative Analysis, Test Construction
Peer reviewed Peer reviewed
Direct linkDirect link
Yi, Qing; Harris, Deborah J.; Gao, Xiaohong – Applied Psychological Measurement, 2008
This study investigated the group invariance of equating results using a science achievement test. Examinees were divided into different subgroups based on the average composite score for test centers, whether they had taken a physics course, and self-reported science grade point average. The reason for dividing examinees into subgroups using such…
Descriptors: Grade Point Average, Science Achievement, Academic Achievement, Physics
Felan, George D. – 2002
This paper discusses the four major types of test equating: (1) mean; (2) linear; (3) equipercentile; and (4) item response theory. The single-group, equivalent-group, and anchor-test data collection designs are presented as methods used for test equating. Issues related to assumptions and equating error are also addressed. The advantages and…
Descriptors: Equated Scores, Item Response Theory
Peer reviewed Peer reviewed
Li, Yuan H.; Lissitz, Robert W. – Applied Psychological Measurement, 2000
Evaluated three types of multidimensional item response theory (MIRT) linking methods through two simulation studies. Results indicate that the best MIRT linking method was an unbiased, effective, and consistent estimator that produced accurate estimates of transformation parameters when errors in estimation of item parameters were manipulated…
Descriptors: Equated Scores, Estimation (Mathematics), Simulation
Peer reviewed Peer reviewed
Oshima, T. C.; Davey, T. C.; Lee, K. – Journal of Educational Measurement, 2000
Evaluated multidimensional linking procedures based on a framework recently proposed by T. Davey, T. Oshima, and K. Lee (1966): (1) the Direct method; (2) the Equated Function method; (3) the Test Characteristic Function method; and (4) the Item Characteristic Function method. Simulation results indicate advantages to the last two methods, but all…
Descriptors: Equated Scores, Simulation, Test Items
Pages: 1  |  ...  |  27  |  28  |  29  |  30  |  31  |  32  |  33  |  34  |  35  |  ...  |  75