Publication Date
| In 2026 | 8 |
| Since 2025 | 2276 |
| Since 2022 (last 5 years) | 12791 |
| Since 2017 (last 10 years) | 33916 |
| Since 2007 (last 20 years) | 68407 |
Descriptor
| Foreign Countries | 30560 |
| Test Validity | 21743 |
| Scores | 18256 |
| Academic Achievement | 16928 |
| Test Construction | 16756 |
| Test Reliability | 15028 |
| Achievement Tests | 14859 |
| Standardized Tests | 14720 |
| Comparative Analysis | 14431 |
| Elementary Secondary Education | 13042 |
| Language Tests | 12551 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 5034 |
| Teachers | 3393 |
| Researchers | 2630 |
| Policymakers | 1232 |
| Administrators | 978 |
| Students | 687 |
| Parents | 325 |
| Counselors | 216 |
| Community | 162 |
| Support Staff | 50 |
| Media Staff | 34 |
| More ▼ | |
Location
| Turkey | 2822 |
| Australia | 2426 |
| Canada | 2270 |
| California | 1854 |
| United States | 1726 |
| Texas | 1615 |
| China | 1578 |
| United Kingdom | 1315 |
| Florida | 1312 |
| United Kingdom (England) | 1202 |
| Germany | 1122 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 121 |
| Meets WWC Standards with or without Reservations | 189 |
| Does not meet standards | 174 |
Rebekah N. Lee – ProQuest LLC, 2022
The current study investigates Talk Aloud Problem Solving (TAPS) for master's level students at Endicott College in preparation for the Behavior Analysis Certification Board (BACB) exam. The exam format of multiple-choice questions was used as the basis for acquisition and utilization of a TAPS skill repertoire. The purpose was to add to the…
Descriptors: Protocol Analysis, Problem Solving, Masters Programs, Graduate Students
Ozge Ersan Cinar – ProQuest LLC, 2022
In educational tests, a group of questions related to a shared stimulus is called a testlet (e.g., a reading passage with multiple related questions). Use of testlets is very common in educational tests. Additionally, computerized adaptive testing (CAT) is a mode of testing where the test forms are created in real time tailoring to the test…
Descriptors: Test Items, Computer Assisted Testing, Adaptive Testing, Educational Testing
Sibiç, Okan; Akçay, Behiye; Arik, Merve – International Journal of Contemporary Educational Research, 2020
One of the diagnostic tests which is very valuable and used in education frequently is two-tier test. Two-tier tests are used for different purposes such as determining misconceptions, determining the comprehension level of students and etc.. Due to the wide and effective uses of two-tier tests, there are different studies in which two-tier tests…
Descriptors: Diagnostic Tests, Test Construction, Science Education, Educational Research
Using Differential Item Functioning to Test for Interrater Reliability in Constructed Response Items
Walker, Cindy M.; Göçer Sahin, Sakine – Educational and Psychological Measurement, 2020
The purpose of this study was to investigate a new way of evaluating interrater reliability that can allow one to determine if two raters differ with respect to their rating on a polytomous rating scale or constructed response item. Specifically, differential item functioning (DIF) analyses were used to assess interrater reliability and compared…
Descriptors: Test Bias, Interrater Reliability, Responses, Correlation
Pugh, Debra; De Champlain, André; Gierl, Mark; Lai, Hollis; Touchie, Claire – Research and Practice in Technology Enhanced Learning, 2020
The purpose of this study was to compare the quality of multiple choice questions (MCQs) developed using automated item generation (AIG) versus traditional methods, as judged by a panel of experts. The quality of MCQs developed using two methods (i.e., AIG or traditional) was evaluated by a panel of content experts in a blinded study. Participants…
Descriptors: Computer Assisted Testing, Test Construction, Multiple Choice Tests, Test Items
Clark, Amy K.; Karvonen, Meagan – Educational Assessment, 2020
Alternate assessments based on alternate achievement standards (AA-AAS) have historically lacked broad validity evidence and an overall evaluation of the extent to which evidence supports intended uses of results. An expanding body of validation literature, the funding of two AA-AAS consortia, and advances in computer-based assessment have…
Descriptors: Alternative Assessment, Test Validity, Test Use, Students with Disabilities
Deneen, Chris – Melbourne Centre for the Study of Higher Education, 2020
This document provides advice on moving from in-place, closed-book examinations to open-book, remote assessment. It offers conceptual considerations, practical tips, and discusses the relative merits of two common approaches to open-book exams: those with strict time limits, and those with broad time limits ('take-home' exams).
Descriptors: Tests, Test Format, Student Evaluation, Evaluation Methods
New Meridian Corporation, 2020
New Meridian Corporation has developed the "Quality Testing Standards and Criteria for Comparability Claims" (QTS). The goal of the QTS is to provide guidance to states that are interested in including content from the New Meridian item bank and intend to make comparability claims with "other assessments" that include New…
Descriptors: Testing, Standards, Comparative Analysis, Guidelines
Lam, Joseph Hin Yan; Tong, Shelley Xiuli – Learning Environments Research, 2023
Despite the increasing use of virtual modalities in schools since the COVID-19 pandemic, no systematic tools exist to evaluate the process of online learning. We developed and validated an Online Learning Process Questionnaire (OLPQ) for assessing online at-home learning among 219 Hong Kong primary-school students and 474 caregivers. Exploratory…
Descriptors: Questionnaires, Test Construction, Test Validity, Electronic Learning
Al-Jarf, Reima – Online Submission, 2023
This study explores the similarities and differences between English and Arabic numeral-based formulaic expressions, and difficulties that student-translators have with them. A corpus of English and Arabic numeral-based formulaic expressions containing zero, two, three, twenty, sixty, hundred, thousand…etc., and another corpus of specialized…
Descriptors: Translation, Arabic, Contrastive Linguistics, Phrase Structure
Bäckström, Pontus – Educational Review, 2023
In the educational literature on peer effects, attention has been brought to the fact that the mechanisms creating peer effects are still to a large extent hidden in obscurity. The hypothesis in the study reported in this article was that the Frame Factor Theory (FFT) can be used to reveal such mechanisms. Using data from the Swedish TIMSS 2015 (N…
Descriptors: Peer Influence, Peer Relationship, Outcomes of Education, Factor Analysis
Thomas, Damon P. – Australian Educational Researcher, 2020
The rapid decline in Australian students' performance on the "National Assessment Program--Literacy and Numeracy" (NAPLAN) writing test is an issue of national concern. This paper presents the first investigation into patterns of achievement and progress on the NAPLAN writing test across the tested year levels (3, 5, 7 and 9) between…
Descriptors: Foreign Countries, National Competency Tests, Literacy, Writing Tests
Ezechukwu, Roseline Ifeoma; Chinecherem, Basil; Oguguo, E.; Ene, Catherine U.; Ugorji, Clifford O. – World Journal of Education, 2020
This study determined the psychometric properties of the Economics Achievement Test (EAT) using Item Response Theory (IRT). Two popular IRT models namely, one-parameter logistics (1PL) and two-parameter logistics (2PL) models were utilized. The researcher adopted instrumentation research design. Four research questions and two hypotheses were…
Descriptors: Economics Education, Economics, Achievement Tests, Psychometrics
Suciati; Munadi, Sudji; Sugiman; Febriyanti, Wiwin Dwi Ratna – European Journal of Educational Research, 2020
This study aims to design mathematical literacy instruments that have evidence of content and construct validity and are reliable for use as an assessment for learning. The research involved eight experts as instrument validators and 273 eighth-grade students of junior high school in Yogyakarta Province. The results showed that the ten…
Descriptors: Numeracy, Mathematics Tests, Test Construction, Test Validity
New York State Education Department, 2020
The instructions in this manual explain the responsibilities of school administrators for the New York State Testing Program (NYSTP) Grades 3-8 English Language Arts and Mathematics Tests. School administrators must be thoroughly familiar with the contents of the manual, and the policies and procedures must be followed as written so that testing…
Descriptors: Testing Programs, Mathematics Tests, Test Format, Computer Assisted Testing

Direct link
Peer reviewed
