Publication Date
| In 2026 | 0 |
| Since 2025 | 2 |
| Since 2022 (last 5 years) | 7 |
| Since 2017 (last 10 years) | 31 |
| Since 2007 (last 20 years) | 65 |
Descriptor
| Test Validity | 154 |
| Testing | 154 |
| Scoring | 141 |
| Test Reliability | 115 |
| Test Construction | 71 |
| Test Interpretation | 30 |
| Language Tests | 28 |
| Test Items | 24 |
| Measurement Techniques | 19 |
| Scores | 18 |
| Test Bias | 18 |
| More ▼ | |
Source
Author
Publication Type
Education Level
| Secondary Education | 19 |
| Elementary Education | 18 |
| Junior High Schools | 16 |
| Middle Schools | 16 |
| Grade 6 | 15 |
| Grade 7 | 15 |
| Early Childhood Education | 14 |
| Grade 8 | 14 |
| Intermediate Grades | 14 |
| Grade 3 | 12 |
| Grade 4 | 12 |
| More ▼ | |
Audience
| Practitioners | 7 |
| Teachers | 3 |
| Administrators | 1 |
| Policymakers | 1 |
Location
| New York | 7 |
| Nebraska | 3 |
| Pennsylvania | 3 |
| United Kingdom | 3 |
| Brazil | 2 |
| Canada | 2 |
| New York (New York) | 2 |
| Asia | 1 |
| Australia | 1 |
| California | 1 |
| China | 1 |
| More ▼ | |
Laws, Policies, & Programs
| No Child Left Behind Act 2001 | 2 |
| Elementary and Secondary… | 1 |
| Individuals with Disabilities… | 1 |
| Individuals with Disabilities… | 1 |
| Individuals with Disabilities… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Venessa F. Manna; Shuhong Li; Spiros Papageorgiou; Lixiong Gu – ETS Research Report Series, 2025
This technical manual describes the purpose and intended uses of the TOEFL iBT test, its target test-taker population, and relevant language use domains. The test design and scoring procedures are presented first, followed by a research agenda intended to support the interpretation and use of test scores. Given the updates to the test starting…
Descriptors: Second Language Learning, English (Second Language), Language Tests, Test Construction
Susan K. Johnsen – Gifted Child Today, 2024
The author provides a checklist for educators who are selecting technically adequate tests for identifying and referring students for gifted education services and programs. The checklist includes questions related to how the test was normed, reliability and validity studies as well as questions related to types of scores, administration, and…
Descriptors: Test Selection, Academically Gifted, Gifted Education, Test Validity
Han, Chao – Language Testing, 2022
Over the past decade, testing and assessing spoken-language interpreting has garnered an increasing amount of attention from stakeholders in interpreter education, professional certification, and interpreting research. This is because in these fields assessment results provide a critical evidential basis for high-stakes decisions, such as the…
Descriptors: Translation, Language Tests, Testing, Evaluation Methods
Jeff Allen; Ty Cruce – ACT Education Corp., 2025
This report summarizes some of the evidence supporting interpretations of scores from the enhanced ACT, focusing on reliability, concurrent validity, predictive validity, and score comparability. The authors argue that the evidence presented in this report supports the interpretation of scores from the enhanced ACT as measures of high school…
Descriptors: College Entrance Examinations, Testing, Change, Scores
Hendrickson, Nicholas K.; McCrimmon, Adam W. – Canadian Journal of School Psychology, 2019
This article describes and reviews the "Behavior Rating Inventory of Executive Function, Second Edition" (BRIEF2; Gioia, Isquith, Guy, & Kenworthy, 2015). Published by PARInc., it is an updated individually administered rating scale of executive function (EF) for children and youth, aged 5 to 18 years. Primarily used in clinical,…
Descriptors: Behavior Rating Scales, Executive Function, Child Behavior, Adolescents
Mattern, Krista; Radunzel, Justine – ACT, Inc., 2019
When applicants take the ACT® more than once, how do colleges and universities reconcile and make sense of the multiple scores? In terms of validity, fairness, and impact on subgroup differences, are certain score-use polices better than others? The focus of this issue brief is to summarize evidence on the validity and fairness of various…
Descriptors: Scoring, College Entrance Examinations, Test Validity, Evaluation Methods
Nebraska Department of Education, 2018
The 2018 Nebraska Student-Centered Assessment System (NSCAS) Summative technical report documents the processes and procedures implemented to support the Spring 2018 NSCAS Summative English Language Arts (ELA), Mathematics, and Science assessments by NWEA under the supervision of the Nebraska Department of Education (NDE). The technical report…
Descriptors: Summative Evaluation, Language Tests, English, Mathematics Tests
Lynch, Sarah – Practical Assessment, Research & Evaluation, 2022
In today's digital age, tests are increasingly being delivered on computers. Many of these computer-based tests (CBTs) have been adapted from paper-based tests (PBTs). However, this change in mode of test administration has the potential to introduce construct-irrelevant variance, affecting the validity of score interpretations. Because of this,…
Descriptors: Computer Assisted Testing, Tests, Scores, Scoring
International Journal of Testing, 2018
The second edition of the International Test Commission Guidelines for Translating and Adapting Tests was prepared between 2005 and 2015 to improve upon the first edition, and to respond to advances in testing technology and practices. The 18 guidelines are organized into six categories to facilitate their use: pre-condition (3), test development…
Descriptors: Translation, Test Construction, Testing, Scoring
Developing a High Performance Digital Education Ecosystem: Institutional Self-Assessment Instruments
Volungeviciene, Airina; Brown, Mark; Greenspon, Rasa; Gaebel, Michael; Morrisroe, Alison – European University Association, 2021
Digitally enhanced learning and teaching is widely used across the European Higher Education Area, with general acceptance growing over the years and institutions widely acknowledging the benefits it brings to the student experience. The strategic focus being placed on digitally enhanced learning and teaching has increased, undoubtedly accelerated…
Descriptors: Educational Technology, Technology Uses in Education, Program Evaluation, Self Evaluation (Groups)
Fairbairn, Judith; Spiby, Richard – European Journal of Special Needs Education, 2019
Language test developers have a responsibility to ensure that their tests are accessible to test takers of various backgrounds and characteristics and also that they have the opportunity to perform to the best of their ability. This principle is widely recognised by educational and language testing associations in guidelines for the production and…
Descriptors: Testing, Language Tests, Test Construction, Testing Accommodations
Collier, Jo-Kate; Huang, Becky – Language Assessment Quarterly, 2020
This article presents a critical review of the Texas English Language Proficiency Assessment System (TELPAS), a large scale standardized English language proficiency (ELP) assessment developed by the Texas Education Agency (TEA) and administered since 2004. TELPAS is used as an annual summative assessment for all English Learners (ELs) in grades…
Descriptors: English (Second Language), Language Proficiency, Language Tests, Standardized Tests
International Journal of Testing, 2019
These guidelines describe considerations relevant to the assessment of test takers in or across countries or regions that are linguistically or culturally diverse. The guidelines were developed by a committee of experts to help inform test developers, psychometricians, test users, and test administrators about fairness issues in support of the…
Descriptors: Test Bias, Student Diversity, Cultural Differences, Language Usage
Karren, Benjamin C. – Journal of Psychoeducational Assessment, 2017
The Gilliam Autism Rating Scale-Third Edition (GARS-3) is a norm-referenced tool designed to screen for autism spectrum disorders (ASD) in individuals between the ages of 3 and 22 (Gilliam, 2014). The GARS-3 test kit consists of three different components and includes an "Examiner's Manual," summary/response forms (50), and the…
Descriptors: Autism, Pervasive Developmental Disorders, Rating Scales, Norm Referenced Tests
Vera Frith; Robert N. Prince – Numeracy, 2018
The National Benchmark Test Project (NBTP) was commissioned by Higher Education South Africa in 2005 to assess the academic proficiency of prospective students. The competencies assessed include quantitative literacy using the NBTP QL test. This instrument is a criterion-referenced multiple-choice test developed collaboratively by South African…
Descriptors: National Competency Tests, Numeracy, Mathematics Tests, Foreign Countries

Peer reviewed
Direct link
