Publication Date
In 2025 | 0 |
Since 2024 | 3 |
Since 2021 (last 5 years) | 3 |
Since 2016 (last 10 years) | 4 |
Since 2006 (last 20 years) | 6 |
Descriptor
Error of Measurement | 40 |
Test Interpretation | 40 |
Test Reliability | 40 |
Scores | 14 |
Criterion Referenced Tests | 11 |
Norm Referenced Tests | 11 |
Testing Problems | 11 |
Test Construction | 10 |
Statistical Analysis | 9 |
Test Validity | 9 |
Measurement Techniques | 8 |
More ▼ |
Source
Author
Publication Type
Education Level
Higher Education | 2 |
Elementary Secondary Education | 1 |
Postsecondary Education | 1 |
Audience
Researchers | 4 |
Practitioners | 1 |
Location
Canada | 1 |
Malaysia | 1 |
Netherlands | 1 |
New Zealand | 1 |
Laws, Policies, & Programs
Assessments and Surveys
Wechsler Adult Intelligence… | 3 |
ACT Assessment | 1 |
Metropolitan Achievement Tests | 1 |
New Jersey College Basic… | 1 |
What Works Clearinghouse Rating
Danielle R. Blazek; Jason T. Siegel – International Journal of Social Research Methodology, 2024
Social scientists have long agreed that satisficing behavior increases error and reduces the validity of survey data. There have been numerous reviews on detecting satisficing behavior, but preventing this behavior has received less attention. The current narrative review provides empirically supported guidance on preventing satisficing by…
Descriptors: Response Style (Tests), Responses, Reaction Time, Test Interpretation
Viola Merhof; Caroline M. Böhm; Thorsten Meiser – Educational and Psychological Measurement, 2024
Item response tree (IRTree) models are a flexible framework to control self-reported trait measurements for response styles. To this end, IRTree models decompose the responses to rating items into sub-decisions, which are assumed to be made on the basis of either the trait being measured or a response style, whereby the effects of such person…
Descriptors: Item Response Theory, Test Interpretation, Test Reliability, Test Validity
Zhong Jian Chee; Anke M. Scheeren; Marieke de Vries – Autism: The International Journal of Research and Practice, 2024
Despite several psychometric advantages over the 50-item Autism Spectrum Quotient, an instrument used to measure autistic traits, the abridged AQ-28 and its cross-cultural validity have not been examined as extensively. Therefore, this study aimed to examine the factor structure and measurement invariance of the AQ-28 in 818 Dutch (M[subscript…
Descriptors: Autism Spectrum Disorders, Questionnaires, Factor Structure, Factor Analysis
Powers, Sonya; Li, Dongmei; Suh, Hongwook; Harris, Deborah J. – ACT, Inc., 2016
ACT reporting categories and ACT Readiness Ranges are new features added to the ACT score reports starting in fall 2016. For each reporting category, the number correct score, the maximum points possible, the percent correct, and the ACT Readiness Range, along with an indicator of whether the reporting category score falls within the Readiness…
Descriptors: Scores, Classification, College Entrance Examinations, Error of Measurement
Elosua, Paula; Iliescu, Dragos – International Journal of Testing, 2012
Psychometric practice does not always converge with the advances of psychometric theory. In order to investigate this gap, the authors focus on the 10 most used psychological tests in Europe, as identified by recent surveys. The article analyzes test manuals published in 6 different European countries for these 10 most used tests. A total of 32…
Descriptors: Psychological Testing, Personality Measures, Error of Measurement, Foreign Countries

Knight, Robert G. – Journal of Consulting and Clinical Psychology, 1983
Discusses the significance of confidence intervals around IQ scores based on a misleading interpretation of the standard error of measurement terms provided in the Wechsler Adult Intelligence Scale-Revised (WAIS-R) manual. Presents standard error values and a table for determining the abnormality of verbal and performance IQ discrepancies.…
Descriptors: Error of Measurement, Foreign Countries, Intelligence Tests, Test Interpretation

Cureton, Edward E.; And Others – Educational and Psychological Measurement, 1973
Study based on F. M. Lord's arguments in 1957 and 1959 that tests of the same length do have the same standard error of measurement. (CB)
Descriptors: Error of Measurement, Statistical Analysis, Test Interpretation, Test Length

Huynh, Huynh – Psychometrika, 1980
A procedure for estimating the rates of false positive and false negative classification in a mastery testing situation is described. Formulas and tables are described for the computations of the standard errors. (Author/JKS)
Descriptors: Cutting Scores, Error of Measurement, Mastery Tests, Screening Tests
Simpson, J. D. – Audio-Visual Language Journal, 1974
Some basic statistical concepts relevant to the teacher--mean scores, standard deviation, normal and skewed distributions, z scores, item analysis, standard error of measurement, reliability--and their use by the teacher are explained. (RM)
Descriptors: Error of Measurement, Evaluation Methods, Norm Referenced Tests, Scoring
Livingston, Samuel A. – 1976
A distinction is made between reliability of measurement and reliability of classification; the "criterion-referenced reliability coefficient" describes the former. Application of this coefficient to the probability distribution of possible scores for a single student yields a meaningful way to describe the reliability of a single score. (Author)
Descriptors: Classification, Criterion Referenced Tests, Error of Measurement, Measurement

Whitely, Susan E. – Journal of Educational Measurement, 1977
A debate concerning specific issues and the general usefulness of the Rasch latent trait test model is continued. Methods of estimation, necessary sample size, and the applicability of the model are discussed. (JKS)
Descriptors: Error of Measurement, Item Analysis, Mathematical Models, Measurement

Ryan, Joseph J.; And Others – Journal of Consulting and Clinical Psychology, 1983
Wechsler Adult Intelligence Scale-Revised protocols from two vocational counseling clients were scored by 19 psychologists and 20 graduate students. Regardless of scorer's experience level, mechanical scoring error produced summary scores varying by as much as 4 to 18 IQ points. (Author/RC)
Descriptors: Error of Measurement, Graduate Students, Higher Education, Intelligence Tests

Brown, Jonathan R. – Language, Speech, and Hearing Services in Schools, 1989
The importance of using the standard error of measurement (SEm) in determining reliability in test scores is emphasized. The SEm is compared to the hypothetical true score for standardized tests, and procedures for calculation of the SEm are explained. (JDD)
Descriptors: Elementary Secondary Education, Error of Measurement, Scores, Standardized Tests

Wright, Benjamin D. – Journal of Educational Measurement, 1977
Statements made in a previous article of this journal concerning the Rasch latent trait test model are questioned. Methods of estimation, necessary sample sizes, several formuli, and the general usefulness of the Rasch model are discussed. (JKS)
Descriptors: Computers, Error of Measurement, Item Analysis, Mathematical Models

Brennan, Robert L.; Kane, Michael T. – Psychometrika, 1977
Using the assumption of randomly parallel tests and concepts from generalizability theory, three signal/noise ratios for domain-referenced tests are developed, discussed, and compared. The three ratios have the same noise but different signals depending upon the kind of decision to be made as a result of measurement. (Author/JKS)
Descriptors: Comparative Analysis, Criterion Referenced Tests, Error of Measurement, Mathematical Models