Publication Date
In 2025 | 39 |
Since 2024 | 192 |
Since 2021 (last 5 years) | 495 |
Since 2016 (last 10 years) | 996 |
Since 2006 (last 20 years) | 2028 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
Researchers | 93 |
Practitioners | 23 |
Teachers | 22 |
Policymakers | 10 |
Administrators | 5 |
Students | 4 |
Counselors | 2 |
Parents | 2 |
Community | 1 |
Location
United States | 47 |
Germany | 42 |
Australia | 34 |
Canada | 27 |
Turkey | 27 |
California | 22 |
United Kingdom (England) | 20 |
Netherlands | 18 |
China | 16 |
New York | 15 |
United Kingdom | 15 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Does not meet standards | 1 |
Schantz, Susan L.; Gardiner, Joseph C.; Gasior, Donna M.; McCaffrey, Robert J.; Sweeney, Anne M.; Humphrey, Harold E. B. – Psychology in the Schools, 2004
D.V. Cicchetti, A.S. Kaufman, and S.S. Sparrow (this issue) use six criteria to evaluate the published findings from seven different studies of PCB exposure and neuropsychological function. They point out a number of weaknesses or flaws in each study and conclude that these weaknesses make the overall conclusion that PCB exposure negatively…
Descriptors: Evaluation Criteria, Prenatal Influences, Infants, Error of Measurement
Sudweeks, Richard R.; Glissmeyer, Connie B.; Morrison, Timothy G.; Wilcox, Bradley R.; Tanner, Mark W. – Reading Research and Instruction, 2004
Oral retellings are strongly recommended as a way to measure reading comprehension for second language learners (Bernhardt, 1985, 1990, 1991). However, the reliability of such ratings is a matter of concern for a variety of reasons (Aiken, 1996; Cooper, 1981; Saal, Downey, & Lahey, 1980). The purpose of this study was to establish reliable rating…
Descriptors: Error of Measurement, Generalizability Theory, Reading Comprehension, Second Language Learning
Hardt, Jochen; Rutter, Michael – Journal of Child Psychology and Psychiatry, 2004
Background: Influential studies have cast doubt on the validity of retrospective reports by adults of their own adverse experiences in childhood. Accordingly, many researchers view retrospective reports with scepticism. Method: A computer-based search, supplemented by hand searches, was used to identify studies reported between 1980 and 2001 in…
Descriptors: Evidence, Siblings, Sexual Abuse, Validity
New Mexico Public Education Department, 2007
The purpose of the NMSBA technical report is to provide users and other interested parties with a general overview of and technical characteristics of the 2007 NMSBA. The 2007 technical report contains the following information: (1) Test development; (2) Scoring procedures; (3) Summary of student performance; (4) Statistical analyses of item and…
Descriptors: Interrater Reliability, Standard Setting, Measures (Individuals), Scoring
Carter, Rufus Lynn – Research & Practice in Assessment, 2006
Many times in both educational and social science research it is impossible to collect data that is complete. When administering a survey, for example, people may answer some questions and not others. This missing data causes a problem for researchers using structural equation modeling (SEM) techniques for data analyses. Because SEM and…
Descriptors: Structural Equation Models, Error of Measurement, Data, Change Strategies
Abedi, Jamal – Teachers College Record, 2006
Assessments in English that are constructed for native English speakers may not provide valid inferences about the achievement of English language learners (ELLs). The linguistic complexity of the test items that are not related to the content of the assessment may increase the measurement error, thus reducing the reliability of the assessment.…
Descriptors: Second Language Learning, Test Items, Psychometrics, Inferences
Solano-Flores, Guillermo; Li, Min – Educational Measurement: Issues and Practice, 2006
We contend that generalizability (G) theory allows the design of psychometric approaches to testing English-language learners (ELLs) that are consistent with current thinking in linguistics. We used G theory to estimate the amount of measurement error due to code (language or dialect). Fourth- and fifth-grade ELLs, native speakers of…
Descriptors: Foreign Countries, Grade 4, Grade 5, English (Second Language)
Lee, Sik-Yum; Xia, Ye-Mao – Psychometrika, 2006
By means of more than a dozen user friendly packages, structural equation models (SEMs) are widely used in behavioral, education, social, and psychological research. As the underlying theory and methods in these packages are vulnerable to outliers and distributions with longer-than-normal tails, a fundamental problem in the field is the…
Descriptors: Maximum Likelihood Statistics, Statistical Distributions, Structural Equation Models, Robustness (Statistics)
Meyer, Kevin D.; Foster, Jeff L. – International Journal of Testing, 2008
With the increasing globalization of human resources practices, a commensurate increase in demand has occurred for multi-language ("global") personality norms for use in selection and development efforts. The combination of data from multiple translations of a personality assessment into a single norm engenders error from multiple sources. This…
Descriptors: Global Approach, Cultural Differences, Norms, Human Resources
Chang, Shun-Wen – Educational and Psychological Measurement, 2006
This study evaluates the effects of employing the linear, normalizing, and arcsine transformation methods for constructing scale scores on the Basic Competence Test (BCTEST). Tests in three subject areas (Chinese, English, and Mathematics) were studied using the data of test administrations from 2001 to 2003. The resulting scale scores for each…
Descriptors: Standardized Tests, Achievement Tests, Test Theory, True Scores
Johanson, George A. – 1992
Most educational measurement texts distinguish between norm-referenced (NR), or relative, methods of assigning letter grades to objective test scores, and criterion-referenced (CR), or absolute, methods. Both NR and CR approaches have serious limitations in typical classroom situations, and neither approach, in its pure form, may be entirely…
Descriptors: Criterion Referenced Tests, Cutting Scores, Educational Testing, Error of Measurement
Rachor, Robert E.; Cizek, Gregory J. – 1996
The gain, or difference, score is defined as the difference between the posttest score and the pretest score for an individual. Gain scores appear to be a natural measure of growth for education and the social sciences, but they contain two sources of measurement error, error in either the pretest or posttest scores, and cannot be considered…
Descriptors: Achievement Gains, Correlation, Educational Research, Elementary Secondary Education
Haertel, Edward H. – 1992
Classical test theory, item response theory, and generalizability theory all treat the abilities to be measured as continuous variables, and the items of a test as independent probes of underlying continua. These models are well-suited to measuring the broad, diffuse traits of traditional differential psychology, but not for measuring the outcomes…
Descriptors: Ability, Data Analysis, Error of Measurement, Generalizability Theory
Muthen, Bengt O.; Nelson, Ginger – 1992
It has been demonstrated that the individual variation in the level and rate of learning for a cohort of students over time can be estimated by hierarchical linear models. Models of this type can also be estimated using widely available structural modeling software, which provides a flexible framework for model explorations, including the use of…
Descriptors: Cohort Analysis, Computer Software, Elementary Secondary Education, Error of Measurement
Hedges, Larry V.; Vevea, Jack L. – 1997
This study investigates the amount of uncertainty added to National Assessment of Educational Progress (NAEP) estimates by equating error under both ideal and less than ideal circumstances. Data from past administrations are used to guide simulations of various equating designs and error due to equating is estimated empirically. The design…
Descriptors: Ability, Elementary Secondary Education, Equated Scores, Error of Measurement