Publication Date
In 2025 | 2 |
Since 2024 | 11 |
Since 2021 (last 5 years) | 58 |
Since 2016 (last 10 years) | 178 |
Since 2006 (last 20 years) | 459 |
Descriptor
Comparative Analysis | 1179 |
Test Validity | 1179 |
Test Reliability | 424 |
Foreign Countries | 245 |
Test Construction | 166 |
Correlation | 164 |
Scores | 157 |
Higher Education | 145 |
Statistical Analysis | 128 |
Psychometrics | 127 |
Language Tests | 121 |
More ▼ |
Source
Author
Linn, Robert L. | 5 |
Oakland, Thomas | 5 |
Baron-Cohen, Simon | 4 |
Fraser, Barry J. | 4 |
Matson, Johnny L. | 4 |
Silverstein, A. B. | 4 |
Allison, Carrie | 3 |
August, Diane | 3 |
Brown, James Dean | 3 |
Flippo, Rona F. | 3 |
Haladyna, Tom | 3 |
More ▼ |
Publication Type
Education Level
Audience
Researchers | 22 |
Practitioners | 19 |
Teachers | 6 |
Administrators | 5 |
Policymakers | 3 |
Counselors | 2 |
Parents | 1 |
Support Staff | 1 |
Location
Australia | 28 |
United States | 28 |
China | 23 |
Turkey | 22 |
Canada | 19 |
Israel | 10 |
Taiwan | 10 |
United Kingdom (England) | 10 |
Japan | 9 |
Netherlands | 9 |
Texas | 9 |
More ▼ |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 3 |
Aid to Families with… | 1 |
Elementary and Secondary… | 1 |
Rehabilitation Act 1973… | 1 |
Workforce Investment Act 1998… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
R. Lanai Jennings; Megan Midkiff; Emily Nestor McCauley; Jeremy Lopuch; Sandra Stroebel; Rachel James; Mary Toler; Rebecca Wendell; Paula King; Mallory Frampton – Contemporary School Psychology, 2024
Reading comprehension is one of the most valuable academic skills taught in school. Selecting the appropriate assessment instrument to ensure early identification and intervention is important as there is an amalgam of cognitive abilities and academic skills involved in reading comprehension. The GORT-5 is the most recent edition of a test that…
Descriptors: Test Validity, Diagnostic Tests, Reading Comprehension, Early Intervention
Yangqiuting Li; Chandralekha Singh – Physical Review Physics Education Research, 2025
Research-based multiple-choice questions implemented in class with peer instruction have been shown to be an effective tool for improving students' engagement and learning outcomes. Moreover, multiple-choice questions that are carefully sequenced to build on each other can be particularly helpful for students to develop a systematic understanding…
Descriptors: Physics, Science Instruction, Science Tests, Multiple Choice Tests
Katrin Klingbeil; Fabian Rösken; Bärbel Barzel; Florian Schacht; Kaye Stacey; Vicki Steinle; Daniel Thurm – ZDM: Mathematics Education, 2024
Assessing students' (mis)conceptions is a challenging task for teachers as well as for researchers. While individual assessment, for example through interviews, can provide deep insights into students' thinking, this is very time-consuming and therefore not feasible for whole classes or even larger settings. For those settings, automatically…
Descriptors: Multiple Choice Tests, Formative Evaluation, Mathematics Tests, Misconceptions
Kate E. Walton; Cristina Anguiano-Carrasco – ACT, Inc., 2024
Large language models (LLMs), such as ChatGPT, are becoming increasingly prominent. Their use is becoming more and more popular to assist with simple tasks, such as summarizing documents, translating languages, rephrasing sentences, or answering questions. Reports like McKinsey's (Chui, & Yee, 2023) estimate that by implementing LLMs,…
Descriptors: Artificial Intelligence, Man Machine Systems, Natural Language Processing, Test Construction
Shear, Benjamin R. – Journal of Educational Measurement, 2023
Large-scale standardized tests are regularly used to measure student achievement overall and for student subgroups. These uses assume tests provide comparable measures of outcomes across student subgroups, but prior research suggests score comparisons across gender groups may be complicated by the type of test items used. This paper presents…
Descriptors: Gender Bias, Item Analysis, Test Items, Achievement Tests
Aleyna Altan; Zehra Taspinar Sener – Online Submission, 2023
This research aimed to develop a valid and reliable test to be used to detect sixth grade students' misconceptions and errors regarding the subject of fractions. A misconception diagnostic test has been developed that includes the concept of fractions, different representations of fractions, ordering and comparing fractions, equivalence of…
Descriptors: Diagnostic Tests, Mathematics Tests, Fractions, Misconceptions
David Bell; Vikki O'Neill; Vivienne Crawford – Practitioner Research in Higher Education, 2023
We compared the influence of open-book extended duration versus closed book time-limited format on reliability and validity of written assessments of pharmacology learning outcomes within our medical and dental courses. Our dental cohort undertake a mid-year test (30xfree-response short answer to a question, SAQ) and end-of-year paper (4xSAQ,…
Descriptors: Undergraduate Students, Pharmacology, Pharmaceutical Education, Test Format
Jane Batamuliza; Gonzague Habinshuti; Jean Baptiste Nkurunziza – Journal of Technology and Science Education, 2024
This current study presents the effects of interactive computer simulations on students' performance and concept retention in the unit of chemical reactions. Purposive sampling was used to select four schools with a sample population of 320. The Achievement test on chemical reactions was developed, validated, and checked for reliability. The…
Descriptors: Chemistry, Science Instruction, Teaching Methods, Comparative Analysis
Leda Lampropoulou – Language Education & Assessment, 2023
Extensive oral tasks or monologues of different types (e.g., presentations, storytelling) are often used as second language acquisition tasks in the fields of language learning and language testing. Pre-task planning time is a common provision to test-takers who may use different strategies to prepare their response. High-stakes tests, such as the…
Descriptors: Language Tests, Speech Communication, Test Validity, Culture Fair Tests
Uminski, Crystal; Hubbard, Joanna K.; Couch, Brian A. – CBE - Life Sciences Education, 2023
Biology instructors use concept assessments in their courses to gauge student understanding of important disciplinary ideas. Instructors can choose to administer concept assessments based on participation (i.e., lower stakes) or the correctness of responses (i.e., higher stakes), and students can complete the assessment in an in-class or…
Descriptors: Biology, Science Tests, High Stakes Tests, Scores
Ute Knoch; Jason Fan – Language Testing, 2024
While several test concordance tables have been published, the research underpinning such tables has rarely been examined in detail. This study aimed to survey the publically available studies or documentation underpinning the test concordance tables of the providers of four major international language tests, all accepted by the Australian…
Descriptors: Language Tests, English, Test Validity, Item Analysis
Ibrahim Abba Mohammed; Ahmed Bello – Pedagogical Research, 2024
Due to rapid technological advancement that continues to permeate almost all facets of the education sector, video learning has been explored to enhance performance, but most researchers do not incorporate flipped classroom in mathematical videos, which affects the teaching and learning of the subject in Nigeria. This study checked the…
Descriptors: Flipped Classroom, Mathematics Tests, Achievement Tests, Private Schools
Matt I. Brown; Patrick R. Heck; Christopher F. Chabris – Journal of Autism and Developmental Disorders, 2024
The Social Shapes Test (SST) is a measure of social intelligence which does not use human faces or rely on extensive verbal ability. The SST has shown promising validity among adults without autism spectrum disorder (ASD), but it is uncertain whether it is suitable for adults with ASD. We find measurement invariance between adults with (n = 229)…
Descriptors: Interpersonal Competence, Autism Spectrum Disorders, Emotional Intelligence, Verbal Ability
Maïano, Christophe; Morin, Alexandre J. S.; Tietjens, Maike; Bastos, Tânia; Luiggi, Maxime; Corredeira, Rui; Griffet, Jean; Sánchez-Oliva, David – Measurement in Physical Education and Exercise Science, 2023
The present study sought to examine the psychometric properties of new German, Portuguese, and Spanish versions of the Revised Short Form of the Physical Self-Inventory (PSI-S-"R"), and to contrast these properties against those from the original French version of this instrument. Participants (n = 1802) were 288 French youth, 177 German…
Descriptors: German, Portuguese, Spanish, Test Construction
von Davier, Matthias; Tyack, Lillian; Khorramdel, Lale – Educational and Psychological Measurement, 2023
Automated scoring of free drawings or images as responses has yet to be used in large-scale assessments of student achievement. In this study, we propose artificial neural networks to classify these types of graphical responses from a TIMSS 2019 item. We are comparing classification accuracy of convolutional and feed-forward approaches. Our…
Descriptors: Scoring, Networks, Artificial Intelligence, Elementary Secondary Education