Publication Date
In 2025 | 39 |
Since 2024 | 192 |
Since 2021 (last 5 years) | 495 |
Since 2016 (last 10 years) | 996 |
Since 2006 (last 20 years) | 2028 |
Descriptor
Error of Measurement | 3295 |
Statistical Analysis | 599 |
Scores | 504 |
Item Response Theory | 445 |
Correlation | 434 |
Comparative Analysis | 422 |
Foreign Countries | 415 |
Test Reliability | 408 |
Computation | 404 |
Simulation | 370 |
Reliability | 355 |
More ▼ |
Source
Author
Publication Type
Education Level
Audience
Researchers | 93 |
Practitioners | 23 |
Teachers | 22 |
Policymakers | 10 |
Administrators | 5 |
Students | 4 |
Counselors | 2 |
Parents | 2 |
Community | 1 |
Location
United States | 47 |
Germany | 42 |
Australia | 34 |
Canada | 27 |
Turkey | 27 |
California | 22 |
United Kingdom (England) | 20 |
Netherlands | 18 |
China | 16 |
New York | 15 |
United Kingdom | 15 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Does not meet standards | 1 |

Hudgins, R. R.; Reilly, P. M. – Chemical Engineering Education, 1989
Discussed are problems encountered when a gas absorption experiment with strong measurement error is used. Notes students either avoid the experiment or report it as defective. Provides ideas to make lab experiments more instructive. (MVL)
Descriptors: Chemical Analysis, Chemical Engineering, Chemistry, College Science

Young, John W. – Journal of Research in Education, 1992
Uses the general linear model to develop an adjusted cumulative grade point average (GPA) that systematically models grading effects among courses. A validation study using 778 courses of 1,564 Stanford (California) University students shows an increase in predictability of the adjusted least-squares GPA over the unadjusted GPA. (SLD)
Descriptors: Academic Achievement, Admission (School), Course Selection (Students), Error of Measurement

Press, S. James; Tanur, Judith M. – Evaluation Review, 1991
Relevance of the intersection of sociology, statistics, and public policy to the study of quality control in three family assistance programs--food stamps, Aid to Families with Dependent Children (AFDC), and Medicaid--is reviewed using a study by the National Academy of Sciences of methods for improving quality control systems. (SLD)
Descriptors: Error of Measurement, Estimation (Mathematics), Federal Aid, Federal Programs
Lilley, M.; Barker, T.; Britton, C. – Computers and Education, 2004
This paper presents ongoing research at the University of Hertfordshire on the use of computer-adaptive tests (CATs) in Higher Education. A software prototype based on Item Response Theory has been developed and is described here. This application was designed to estimate the level of proficiency in English for those students whose first language…
Descriptors: Foreign Countries, Adaptive Testing, Computer Assisted Testing, Computer Software Evaluation
Kane, Thomas J.; Staiger, Douglas O. – Brookings Papers on Education Policy, 2002
By the spring of 2000, forty states had begun using student test scores to rate school performance. Twenty states have gone a step further and are attaching explicit monetary rewards or sanctions to a school's test performance. In this paper, the authors focus on accountability programs in which states measure the effectiveness of individual…
Descriptors: Elementary Schools, Accountability, Scores, Risk
Gorsuch, Greta – CALICO Journal, 2004
In this study, retrospective interviews were used to investigate reliability (and thus validity) threats to a computerized ESL listening comprehension test administered at a university in the US. The participants in the investigation, six international graduate students, were asked to respond to semi- and open-ended questions during individual…
Descriptors: Graduate Students, Listening Comprehension, Investigations, Listening Comprehension Tests
Takalkar, Pradnya; And Others – 1993
This study compared 4,594 student responses from three different surveys of incoming students at the University of South Florida (USF) with data from Florida's State University System (SUS) admissions files to determine what proportion of error occurs in the survey responses. Specifically, the study investigated the amount of measurement error in…
Descriptors: College Admission, College Applicants, College Bound Students, Comparative Analysis
Spencer, Bruce D. – 1986
The National Assessment of Educational Progress (NAEP) currently tests seventeen-year-old students enrolled in public and private secondary schools, but it does not test "out-of-school" seventeen-year-olds who have either graduated or dropped out. Estimating that one of five seventeen-year-olds is out of school, the interpretability of…
Descriptors: Adolescents, Cohort Analysis, Dropouts, Educational Assessment
Angoff, William H.; Cowell, William R. – 1985
Linear and equipercentile equating conversions were developed for two forms of the Graduate Record Examinations (GRE) quantitative test and the verbal-plus-quantitative test. From a very large sample of students taking the GRE in October 1981, subpopulations were selected with respect to race, sex, field of study, and level of performance (defined…
Descriptors: Aptitude Tests, College Entrance Examinations, Equated Scores, Error of Measurement
Rodgers, Willard L.; Bachman, Jerald G. – 1986
This paper explores various procedures of panel data in the estimation of causal models. The reported analyses are from the Monitoring the Future study, a nationwide questionnaire survey of 16,000 to 17,000 high school seniors conducted annually since 1975. First, the parameters of causal models are estimated in which the dependent variables are…
Descriptors: Attitude Measures, Attribution Theory, Comparative Analysis, Drug Use
Cook, Linda L.; Petersen, Nancy S. – 1986
This paper examines how various equating methods are affected by: (1) sampling error; (2) sample characteristics; and (3) characteristics of anchor test items. It reviews empirical studies that investigated the invariance of equating transformations, and it discusses empirical and simulation studies that focus on how the properties of anchor tests…
Descriptors: Educational Research, Equated Scores, Error of Measurement, Evaluation Methods
Hummel, Thomas J.; Johnston, Charles B. – 1986
This study investigated seven methods for analyzing multivariate group differences. Bonferroni t statistics, multivariate analysis of variance (MANOVA) followed by analysis of variance (ANOVA), and five other methods were studied using Monte Carlo methods. Methods were compared with respect to (1) experimentwise error rate; (2) power; (3) number…
Descriptors: Analysis of Variance, Comparative Analysis, Correlation, Differences
Ridgeway, Gretchen Freiheit – 1982
A one-parameter latent trait model was the basis of the test development procedures in the Basic Skills Assessment Program (BSAP) of the Department of Defense Dependents Schools (DoDDS). Several issues are involved in applying the Rasch model to an assessment program in a large school district. Separate sets of skills continua are arranged by…
Descriptors: Achievement Tests, Basic Skills, Dependents Schools, Difficulty Level
Hendrickson, Leslie; Jones, Barnie – 1982
The logic of using a gain score approach versus longitudinal causal models is studied in this secondary analysis of a complex data base. The gain score model used by the Federal Reserve Bank and the School District of Philadelphia in their "What Works in Reading?" study is successively refined using the LISREL structural equation…
Descriptors: Achievement Gains, Achievement Tests, Data Analysis, Elementary Education
Gustafsson, Jan-Eric – 1977
The Rasch model for test analysis is described and compared with two-parameter and three-parameter latent-trait models. Conditional maximum likelihood equations for estimating item parameters are derived, and estimates of person parameters are described together with their confidence intervals. Goodness of fit tests are discussed, including a…
Descriptors: Adaptive Testing, Computer Programs, Equated Scores, Error of Measurement