Publication Date
| In 2026 | 0 |
| Since 2025 | 53 |
| Since 2022 (last 5 years) | 411 |
| Since 2017 (last 10 years) | 914 |
| Since 2007 (last 20 years) | 1965 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Researchers | 93 |
| Practitioners | 23 |
| Teachers | 22 |
| Policymakers | 10 |
| Administrators | 5 |
| Students | 4 |
| Counselors | 2 |
| Parents | 2 |
| Community | 1 |
Location
| United States | 47 |
| Germany | 42 |
| Australia | 34 |
| Canada | 27 |
| Turkey | 27 |
| California | 22 |
| United Kingdom (England) | 20 |
| Netherlands | 18 |
| China | 17 |
| New York | 15 |
| United Kingdom | 15 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Does not meet standards | 1 |
Braun, Henry; Qian, Jiahe – ETS Research Report Series, 2008
This report describes the derivation and evaluation of a method for comparing the performance standards for public school students set by different states. It is based on an approach proposed by McLaughlin and associates, which constituted an innovative attempt to resolve the confusion and concern that occurs when very different proportions of…
Descriptors: State Standards, Comparative Analysis, Public Schools, National Competency Tests
Alonzo, Julie; Liu, Kimy; Tindal, Gerald – Behavioral Research and Teaching, 2008
This technical report describes the development of reading comprehension assessments designed for use as progress monitoring measures appropriate for 2nd Grade students. The creation, piloting, and technical adequacy of the measures are presented. The following are appended: (1) Item Specifications for MC [Multiple Choice] Comprehension - Passage…
Descriptors: Reading Comprehension, Reading Tests, Grade 2, Elementary School Students
Alonzo, Julie; Tindal, Gerald – Behavioral Research and Teaching, 2008
This technical report describes the development and piloting of reading comprehension measures developed for use by fifth-grade students as part of an online progress monitoring assessment system, http://easycbm.com. Each comprehension measure is comprised of an original work of narrative fiction approximately 1500 words in length followed by 20…
Descriptors: Reading Comprehension, Reading Tests, Grade 5, Multiple Choice Tests
Kluge, Annette – Applied Psychological Measurement, 2008
The use of microworlds (MWs), or complex dynamic systems, in educational testing and personnel selection is hampered by systematic measurement errors because these new and innovative item formats are not adequately controlled for their difficulty. This empirical study introduces a way to operationalize an MW's difficulty and demonstrates the…
Descriptors: Personnel Selection, Self Efficacy, Educational Testing, Computer Uses in Education
von Davier, Alina A.; Holland, Paul W.; Livingston, Samuel A.; Casabianca, Jodi; Grant, Mary C.; Martin, Kathleen – ETS Research Report Series, 2006
This study examines how closely the kernel equating (KE) method (von Davier, Holland, & Thayer, 2004a) approximates the results of other observed-score equating methods--equipercentile and linear equatings. The study used pseudotests constructed of item responses from a real test to simulate three equating designs: an equivalent groups (EG)…
Descriptors: Equated Scores, Statistical Analysis, Simulation, Tests
Zhang, Yanwei; Breithaupt, Krista; Tessema, Aster; Chuah, David – Online Submission, 2006
Two IRT-based procedures to estimate test reliability for a certification exam that used both adaptive (via a MST model) and non-adaptive design were considered in this study. Both procedures rely on calibrated item parameters to estimate error variance. In terms of score variance, one procedure (Method 1) uses the empirical ability distribution…
Descriptors: Individual Testing, Test Reliability, Programming, Error of Measurement
Gonzalez-Roma, Vicente; Hernandez, Ana; Gomez-Benito, Juana – Multivariate Behavioral Research, 2006
In this simulation study, we investigate the power and Type I error rate of a procedure based on the mean and covariance structure analysis (MACS) model in detecting differential item functioning (DIF) of graded response items with five response categories. The following factors were manipulated: type of DIF (uniform and non-uniform), DIF…
Descriptors: Multivariate Analysis, Item Response Theory, Test Bias, Sample Size
Sass, Daniel A.; Smith, Philip L. – Structural Equation Modeling: A Multidisciplinary Journal, 2006
Structural equation modeling allows several methods of estimating the disattenuated association between 2 or more latent variables (i.e., the measurement model). In one common approach, measurement models are specified using item parcels as indicators of latent constructs. Item parcels versus original items are often used as indicators in these…
Descriptors: Structural Equation Models, Item Analysis, Error of Measurement, Measures (Individuals)
Aguinis, Herman; Pierce, Charles A. – Applied Psychological Measurement, 2006
The computation and reporting of effect size estimates is becoming the norm in many journals in psychology and related disciplines. Despite the increased importance of effect sizes, researchers may not report them or may report inaccurate values because of a lack of appropriate computational tools. For instance, Pierce, Block, and Aguinis (2004)…
Descriptors: Effect Size, Multiple Regression Analysis, Predictor Variables, Error of Measurement
Meyers, Jason L.; Beretvas, S. Natasha – Multivariate Behavioral Research, 2006
Cross-classified random effects modeling (CCREM) is used to model multilevel data from nonhierarchical contexts. These models are widely discussed but infrequently used in social science research. Because little research exists assessing when it is necessary to use CCREM, 2 studies were conducted. A real data set with a cross-classified structure…
Descriptors: Social Science Research, Computation, Models, Data Analysis
Linacre, John Michael – 1995
Various methods of estimating main effects from ordinal data are presented and contrasted. Problems discussed include: (1) at what level to accumulate ordinal data into linear measures; (2) how to maintain scaling across analyses; and (3) the inevitable confounding of within cell variance with measurement error. An example shows three methods of…
Descriptors: Analysis of Variance, Demography, Error of Measurement, Estimation (Mathematics)
Barnette, J. Jackson; McLean, James E. – 1998
Tukey's Honestly Significant Difference (HSD) procedure (J. Tukey, 1953) is probably the most recommended and used procedure for controlling Type I error rate when making multiple pairwise comparisons as follow-ups to a significant omnibus F test. This study compared observed Type I errors with nominal alphas of 0.01, 0.05, and 0.10 compared for…
Descriptors: Comparative Analysis, Error of Measurement, Monte Carlo Methods, Research Methodology
PDF pending restorationJarrell, Michele Glankler – 1992
This repeated measures factorial design study compared the results of two procedures for identifying multivariate outliers under varying conditions, the Mahalanobis distance and the Andrews-Pregibon statistic. Results were analyzed for the total number of outliers identified and number of false outliers identified. Simulated data were limited to…
Descriptors: Comparative Analysis, Computer Simulation, Error of Measurement, Mathematical Models
Kish, Leslie – 1989
A brief, practical overview of "design effects" (DEFFs) is presented for users of the results of sample surveys. The overview is intended to help such users to determine how and when to use DEFFs and to compute them correctly. DEFFs are needed only for inferential statistics, not for descriptive statistics. When the selections for…
Descriptors: Computer Software, Error of Measurement, Mathematical Models, Research Design
Alderman, Donald L. – 1981
This study applies a procedure which yields estimates of true score change on the Scholastic Aptitude Test (SAT) adjusted for regression effects and student self-selection. It is shown that student self-selection in deciding to repeat an admissions test probably involves factors in addition to the measurement error attributable to variations in…
Descriptors: College Entrance Examinations, Error of Measurement, Regression (Statistics), Scores

Peer reviewed
Direct link
