Publication Date
In 2025 | 1 |
Since 2024 | 6 |
Since 2021 (last 5 years) | 28 |
Since 2016 (last 10 years) | 85 |
Since 2006 (last 20 years) | 175 |
Descriptor
Scoring | 402 |
Test Reliability | 402 |
Test Validity | 402 |
Test Construction | 153 |
Testing | 104 |
Test Items | 67 |
Test Interpretation | 60 |
Psychometrics | 55 |
Item Analysis | 49 |
Language Tests | 46 |
Measurement Techniques | 42 |
More ▼ |
Source
Author
McCrimmon, Adam W. | 6 |
Stansfield, Charles W. | 4 |
Breland, Hunter M. | 3 |
Frary, Robert B. | 3 |
Guthrie, P. D. | 3 |
Hambleton, Ronald K. | 3 |
Paek, Insu | 3 |
Schoen, Robert C. | 3 |
Yang, Xiaotong | 3 |
Anna-Maria Fall | 2 |
Bae, Yunhee | 2 |
More ▼ |
Publication Type
Education Level
Audience
Practitioners | 22 |
Researchers | 11 |
Administrators | 8 |
Teachers | 8 |
Policymakers | 5 |
Students | 3 |
Counselors | 1 |
Parents | 1 |
Location
New York | 13 |
Canada | 7 |
Nebraska | 7 |
Pennsylvania | 5 |
Turkey | 5 |
Australia | 4 |
United States | 4 |
New Mexico | 3 |
Texas | 3 |
United Kingdom (England) | 3 |
California | 2 |
More ▼ |
Laws, Policies, & Programs
Individuals with Disabilities… | 5 |
No Child Left Behind Act 2001 | 2 |
Education Consolidation… | 1 |
Individuals with Disabilities… | 1 |
Individuals with Disabilities… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Marcos Jiménez; María Zapata-Cáceres; Marcos Román-González; Gregorio Robles; Jesús Moreno-León; Estefanía Martín-Barroso – Journal of Science Education and Technology, 2024
Computational thinking (CT) is a multidimensional term that encompasses a wide variety of problem-solving skills related to the field of computer science. Unfortunately, standardized, valid, and reliable methods to assess CT skills in preschool children are lacking, compromising the reliability of the results reported in CT interventions. To…
Descriptors: Computation, Thinking Skills, Student Evaluation, Preschool Children
Susan K. Johnsen – Gifted Child Today, 2024
The author provides a checklist for educators who are selecting technically adequate tests for identifying and referring students for gifted education services and programs. The checklist includes questions related to how the test was normed, reliability and validity studies as well as questions related to types of scores, administration, and…
Descriptors: Test Selection, Academically Gifted, Gifted Education, Test Validity
Reuben S. Asempapa; Doris Lee – Discover Education, 2025
Across the world, standards and practices for preparing teachers of mathematics emphasize the importance of math modeling (MM) in developing students' mathematical thinking. The aim of this research study was to develop the Mathematical Modeling Knowledge Scale (MAMKS), capable of determining preservice teachers' (PSTs') knowledge of MM. The study…
Descriptors: Preservice Teachers, Preservice Teacher Education, Mathematics Education, Mathematics Curriculum
Alatli, Betül – International Journal of Curriculum and Instruction, 2022
This study was conducted to review the use of tests. For this purpose, 45 articles in which the Turkish form of the "Test Anxiety Inventory (TAI)," which is one of the tests frequently used in the field of education, was employed and that were published between 2000 and 2020 were examined in terms of factors that should be considered in…
Descriptors: Anxiety, Likert Scales, Test Anxiety, Test Reliability
Fergadiotis, Gerasimos; Casilio, Marianne; Dickey, Michael Walsh; Steel, Stacey; Nicholson, Hannele; Fleegle, Mikala; Swiderski, Alexander; Hula, William D. – Journal of Speech, Language, and Hearing Research, 2023
Purpose: Item response theory (IRT) is a modern psychometric framework with several advantageous properties as compared with classical test theory. IRT has been successfully used to model performance on anomia tests in individuals with aphasia; however, all efforts to date have focused on noun production accuracy. The purpose of this study is to…
Descriptors: Item Response Theory, Psychometrics, Verbs, Naming
Sachin Nedungadi; Corina E. Brown; Sue Hyeon Paek – Journal of Chemical Education, 2022
The Fundamental Concepts for Organic Reaction Mechanisms Inventory (FC-ORMI) is a concept inventory with most items in a two-tier design in which an answer tier is followed by a reasoning tier. Statistical results provided strong evidence for the validity and reliability of the data obtained using the FC-ORMI. In this study, differential item…
Descriptors: Test Bias, Test Validity, Test Reliability, Gender Differences
Williams, Zachary J.; Cascio, Carissa J.; Woynaroski, Tiffany G. – Autism: The International Journal of Research and Practice, 2023
Quality of life is widely acknowledged as one of the most important outcomes in autism research, but few measures of this construct have been validated for use in autistic people. The goal of the current study was to examine the psychometric properties of the Patient-Reported Outcomes Measurement Information System Global--10, an established…
Descriptors: Quality of Life, Autism Spectrum Disorders, Adults, Psychometrics
Gayle Geschwind; Michael Vignal; Marcos D. Caballero; H.? J. Lewandowski – Physical Review Physics Education Research, 2024
The Survey of Physics Reasoning on Uncertainty Concepts in Experiments (SPRUCE) was designed to measure students' proficiency with measurement uncertainty concepts and practices across ten different assessment objectives to help facilitate the improvement of laboratory instruction focused on this important topic. To ensure the reliability and…
Descriptors: Measurement, Ambiguity (Context), Scientific Concepts, Physics
Lenz, A. Stephen; Ault, Haley; Balkin, Richard S.; Barrio Minton, Casey; Erford, Bradley T.; Hays, Danica G.; Kim, Bryan S. K.; Li, Chi – Measurement and Evaluation in Counseling and Development, 2022
In April 2021, The Association for Assessment and Research in Counseling Executive Council commissioned a time-referenced task group to revise the Responsibilities of Users of Standardized Tests (RUST) Statement (3rd edition) published by the Association for Assessment in Counseling (AAC) in 2003. The task group developed a work plan to implement…
Descriptors: Responsibility, Standardized Tests, Counselor Training, Ethics
Johnson, Jennifer M.; Meisinger, Elizabeth B.; Robinson, Melissa F. – Journal of Psychoeducational Assessment, 2019
This review focuses on the Feifer Assessment of Reading (FAR) produced by S. G. Feifer and R. G. Nader in 2015. The FAR is a comprehensive reading test that is individually administered to children and adults aged 4 to 21 years. The structure of the FAR is based on a gradient model of brain functioning (Goldberg, 1990; Luria, 1980) and reflects a…
Descriptors: Reading Tests, Scoring, Test Construction, Test Norms
Güntay Tasçi – Science Insights Education Frontiers, 2024
The present study has aimed to develop and validate a protein concept inventory (PCI) consisting of 25 multiple-choice (MC) questions to assess students' understanding of protein, which is a fundamental concept across different biology disciplines. The development process of the PCI involved a literature review to identify protein-related content,…
Descriptors: Science Instruction, Science Tests, Multiple Choice Tests, Biology
Latifi, Syed; Gierl, Mark – Language Testing, 2021
An automated essay scoring (AES) program is a software system that uses techniques from corpus and computational linguistics and machine learning to grade essays. In this study, we aimed to describe and evaluate particular language features of Coh-Metrix for a novel AES program that would score junior and senior high school students' essays from…
Descriptors: Writing Evaluation, Computer Assisted Testing, Scoring, Essays
Guo, Hongwen; Ling, Guangming; Frankel, Lois – ETS Research Report Series, 2020
With advances in technology, researchers and test developers are developing new item types to measure complex skills like problem solving and critical thinking. Analyzing such items is often challenging because of their complicated response patterns, and thus it is important to develop psychometric methods for practitioners and researchers to…
Descriptors: Test Construction, Test Items, Item Analysis, Psychometrics
Lynsey Joohyun Lee – ProQuest LLC, 2021
Reliability and validity are two important topics that have been studied for many decades in the educational measurement field, including discussions of Writing Studies' subfield of writing assessment, since the establishment of the College Entrance Exam Board [CEEB] in 1899 (Huot et al., 2010). In recent years, scholarly conversations of fairness…
Descriptors: Writing Evaluation, Test Validity, Test Reliability, Case Studies
Hendrickson, Nicholas K.; McCrimmon, Adam W. – Canadian Journal of School Psychology, 2019
This article describes and reviews the "Behavior Rating Inventory of Executive Function, Second Edition" (BRIEF2; Gioia, Isquith, Guy, & Kenworthy, 2015). Published by PARInc., it is an updated individually administered rating scale of executive function (EF) for children and youth, aged 5 to 18 years. Primarily used in clinical,…
Descriptors: Behavior Rating Scales, Executive Function, Child Behavior, Adolescents