Publication Date
In 2025 | 2 |
Since 2024 | 8 |
Since 2021 (last 5 years) | 11 |
Since 2016 (last 10 years) | 19 |
Since 2006 (last 20 years) | 46 |
Descriptor
Test Interpretation | 188 |
Test Reliability | 188 |
Test Validity | 116 |
Testing | 50 |
Test Reviews | 44 |
Standardized Tests | 40 |
Test Format | 37 |
Elementary Secondary Education | 34 |
Scores | 31 |
Screening Tests | 31 |
Test Content | 29 |
More ▼ |
Source
Author
Reynolds, Cecil R. | 3 |
Bracken, Bruce A. | 2 |
Frisbie, David A. | 2 |
Smith, Douglas K. | 2 |
Wainer, Howard | 2 |
Alfonso, Vincent C. | 1 |
Aloisi, Cesare | 1 |
Amit Sevak | 1 |
Anastasi, Anne | 1 |
Andrews, Jac J. W. | 1 |
Angeles F. Estévez | 1 |
More ▼ |
Publication Type
Education Level
Elementary Secondary Education | 6 |
Elementary Education | 3 |
Higher Education | 3 |
Postsecondary Education | 2 |
Adult Education | 1 |
Grade 1 | 1 |
Grade 9 | 1 |
Kindergarten | 1 |
Audience
Practitioners | 12 |
Researchers | 8 |
Teachers | 4 |
Counselors | 1 |
Location
United Kingdom | 2 |
Australia | 1 |
Canada | 1 |
Greece | 1 |
Malaysia | 1 |
Netherlands | 1 |
New York (New York) | 1 |
New Zealand | 1 |
Spain | 1 |
Sweden | 1 |
United Kingdom (England) | 1 |
More ▼ |
Laws, Policies, & Programs
Education for All Handicapped… | 1 |
No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Kent Anderson Seidel – School Leadership Review, 2025
This paper examines one of three central diagnostic tools of the Concerns Based Adoption Model, the Stages of Concern Questionnaire (SoCQ). The SoCQ was developed with a focus on K12 education. It has been used widely since developed in 1973, in early childhood, higher education, medical, business, community, and military settings. The SoCQ…
Descriptors: Questionnaires, Educational Change, Educational Innovation, Intervention
Danielle R. Blazek; Jason T. Siegel – International Journal of Social Research Methodology, 2024
Social scientists have long agreed that satisficing behavior increases error and reduces the validity of survey data. There have been numerous reviews on detecting satisficing behavior, but preventing this behavior has received less attention. The current narrative review provides empirically supported guidance on preventing satisficing by…
Descriptors: Response Style (Tests), Responses, Reaction Time, Test Interpretation
Kylie Gorney; Sandip Sinharay – Journal of Educational Measurement, 2025
Although there exists an extensive amount of research on subscores and their properties, limited research has been conducted on categorical subscores and their interpretations. In this paper, we focus on the claim of Feinberg and von Davier that categorical subscores are useful for remediation and instructional purposes. We investigate this claim…
Descriptors: Tests, Scores, Test Interpretation, Alternative Assessment
Farmer, Ryan L.; Kim, Samuel Y. – Psychology in the Schools, 2020
Many prominent intelligence tests (e.g., Wechsler Intelligence Scale for Children, Fifth Edition [WISC-V] and Reynolds Intellectual Abilities Scale, Second Edition [RIAS-2]) offer methods for computing subtest- and composite-level difference scores. This study uses data provided in the technical manual of the WISC-V and RIAS-2 to calculate…
Descriptors: Children, Intelligence Tests, Scores, Test Reliability
Marta Godoy-Giménez; Ángel García-Pérez; Fernando Cañadas; Angeles F. Estévez; Pablo Sayans-Jiménez – Autism: The International Journal of Research and Practice, 2024
The broad autism phenotype is the phenotypic expression of the primary characteristics of autism. However, currently available tests do not agree with the two-domain operationalization of broad autism phenotype or autism, and their internal structure has shown instability across applications. This study presents the Broad Autism…
Descriptors: Autism Spectrum Disorders, Genetics, Diagnostic Tests, Foreign Countries
Viola Merhof; Caroline M. Böhm; Thorsten Meiser – Educational and Psychological Measurement, 2024
Item response tree (IRTree) models are a flexible framework to control self-reported trait measurements for response styles. To this end, IRTree models decompose the responses to rating items into sub-decisions, which are assumed to be made on the basis of either the trait being measured or a response style, whereby the effects of such person…
Descriptors: Item Response Theory, Test Interpretation, Test Reliability, Test Validity
Eirini M. Mitropoulou; Leonidas A. Zampetakis; Ioannis Tsaousis – Evaluation Review, 2024
Unfolding item response theory (IRT) models are important alternatives to dominance IRT models in describing the response processes on self-report tests. Their usage is common in personality measures, since they indicate potential differentiations in test score interpretation. This paper aims to gain a better insight into the structure of trait…
Descriptors: Foreign Countries, Adults, Item Response Theory, Personality Traits
Lestari, Santi B.; Brunfaut, Tineke – Language Testing, 2023
Assessing integrated reading-into-writing task performances is known to be challenging, and analytic rating scales have been found to better facilitate the scoring of these performances than other common types of rating scales. However, little is known about how specific operationalizations of the reading-into-writing construct in analytic rating…
Descriptors: Reading Writing Relationship, Writing Tests, Rating Scales, Writing Processes
Zhong Jian Chee; Anke M. Scheeren; Marieke de Vries – Autism: The International Journal of Research and Practice, 2024
Despite several psychometric advantages over the 50-item Autism Spectrum Quotient, an instrument used to measure autistic traits, the abridged AQ-28 and its cross-cultural validity have not been examined as extensively. Therefore, this study aimed to examine the factor structure and measurement invariance of the AQ-28 in 818 Dutch (M[subscript…
Descriptors: Autism Spectrum Disorders, Questionnaires, Factor Structure, Factor Analysis
Villarreal, Victor; Sullivan, Jeremy; Hechler, Joseph M.; Ruiz, Karen – Journal of Applied School Psychology, 2021
Assessment of functional impairment provides information that is complementary to diagnostic criteria information and is critical for identifying targets for intervention and evaluating treatment outcomes. This review presents summative psychometric information for five multidimensional measures of functional impairment developed for use with…
Descriptors: Psychometrics, Psychological Evaluation, Summative Evaluation, Test Reliability
LaFlair, Geoffrey T.; Langenfeld, Thomas; Baig, Basim; Horie, André Kenji; Attali, Yigal; von Davier, Alina A. – Journal of Computer Assisted Learning, 2022
Background: Digital-first assessments leverage the affordances of technology in all elements of the assessment process--from design and development to score reporting and evaluation to create test taker-centric assessments. Objectives: The goal of this paper is to describe the engineering, machine learning, and psychometric processes and…
Descriptors: Computer Assisted Testing, Affordances, Scoring, Engineering
Iaccarino, Stephanie; von der Embse, Nathaniel; Kilgus, Stephen – Journal of Psychoeducational Assessment, 2019
Detecting mental illness in school students may prevent poor school outcomes. Clinicians often use universal behavioral screeners to identify students at risk for mental illness. This study examined the applicability of Kane's interpretation and use argument (IUA) to the Social, Academic, and Emotional Behavior Risk Screener--Teacher Rating Scale…
Descriptors: Screening Tests, Test Interpretation, Test Use, Mental Disorders
Patrick Kyllonen; Amit Sevak; Teresa Ober; Ikkyu Choi; Jesse Sparks; Daniel Fishtein – ETS Research Report Series, 2024
Assessment refers to a broad array of approaches for measuring or evaluating a person's (or group of persons') skills, behaviors, dispositions, or other attributes. Assessments range from standardized tests used in admissions, employee selection, licensure examinations, and domestic and international large-scale assessments of cognitive and…
Descriptors: Assessment Literacy, Testing, Test Bias, Test Construction
Garcia, Allen G.; Lambert, Matthew C.; Epstein, Michael H.; Cullinan, Douglas – School Mental Health, 2019
The present study examined the measurement properties of the Emotional and Behavioral Screener (EBS), a universal screening instrument which identifies students presenting with emotional and behavioral problems. The primary research questions sought to examine the degree to which the EBS item responses fit the Rasch model through evaluating fit of…
Descriptors: Screening Tests, Identification, Behavior Rating Scales, Emotional Disturbances
Karren, Benjamin C. – Journal of Psychoeducational Assessment, 2017
The Gilliam Autism Rating Scale-Third Edition (GARS-3) is a norm-referenced tool designed to screen for autism spectrum disorders (ASD) in individuals between the ages of 3 and 22 (Gilliam, 2014). The GARS-3 test kit consists of three different components and includes an "Examiner's Manual," summary/response forms (50), and the…
Descriptors: Autism, Pervasive Developmental Disorders, Rating Scales, Norm Referenced Tests