Publication Date
In 2025 | 1 |
Since 2024 | 2 |
Since 2021 (last 5 years) | 8 |
Since 2016 (last 10 years) | 21 |
Since 2006 (last 20 years) | 82 |
Descriptor
Source
Author
Clariana, Roy B. | 2 |
Darling-Hammond, Linda | 2 |
Torney-Purta, Judith | 2 |
Abedi, Jamal | 1 |
Alonso Rivas, Javier | 1 |
Anbar, Michael | 1 |
Arias, Beatriz | 1 |
Arnett, Patricia L. | 1 |
Arnò, Simone | 1 |
Bacon, Donald R. | 1 |
Baird, Matthew | 1 |
More ▼ |
Publication Type
Education Level
Location
Australia | 8 |
United States | 7 |
United Kingdom | 5 |
United Kingdom (England) | 5 |
Connecticut | 4 |
New York | 4 |
North Carolina | 4 |
Canada | 3 |
Hong Kong | 2 |
Kansas | 2 |
Malaysia | 2 |
More ▼ |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 3 |
Every Student Succeeds Act… | 2 |
Elementary and Secondary… | 1 |
Elementary and Secondary… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Ioana-Elena Oana; Carsten Q. Schneider – Sociological Methods & Research, 2024
The robustness of qualitative comparative analysis (QCA) results features high on the agenda of methodologists and practitioners. This article aims at advancing this debate on several fronts. First, in line with the extant literature, we take a comprehensive view on robustness arguing that decisions on calibration, consistency, and frequency…
Descriptors: Robustness (Statistics), Qualitative Research, Comparative Analysis, Decision Making
Hertog, Steffen – Sociological Methods & Research, 2023
In mixed methods approaches, statistical models are used to identify "nested" cases for intensive, small-n investigation for a range of purposes, including notably the examination of causal mechanisms. This article shows that under a commonsense interpretation of causal effects, large-n models allow no reliable conclusions about effect…
Descriptors: Case Studies, Generalization, Prediction, Mixed Methods Research
Baldwin, Peter; Clauser, Brian E. – Journal of Educational Measurement, 2022
While score comparability across test forms typically relies on common (or randomly equivalent) examinees or items, innovations in item formats, test delivery, and efforts to extend the range of score interpretation may require a special data collection before examinees or items can be used in this way--or may be incompatible with common examinee…
Descriptors: Scoring, Testing, Test Items, Test Format
Fuchimoto, Kazuma; Ishii, Takatoshi; Ueno, Maomi – IEEE Transactions on Learning Technologies, 2022
Educational assessments often require uniform test forms, for which each test form has equivalent measurement accuracy but with a different set of items. For uniform test assembly, an important issue is the increase of the number of assembled uniform tests. Although many automatic uniform test assembly methods exist, the maximum clique algorithm…
Descriptors: Simulation, Efficiency, Test Items, Educational Assessment
Puhan, Gautam; Kim, Sooyeon – Journal of Educational Measurement, 2022
As a result of the COVID-19 pandemic, at-home testing has become a popular delivery mode in many testing programs. When programs offer at-home testing to expand their service, the score comparability between test takers testing remotely and those testing in a test center is critical. This article summarizes statistical procedures that could be…
Descriptors: Scores, Scoring, Comparative Analysis, Testing
National Assessment of Educational Progress (NAEP), 2025
Also known as The Nation's Report Card, the National Assessment of Educational Progress (NAEP) is the largest nationally representative and continuing assessment of student achievement and their learning experiences in various subjects for the nation, states, and 27 urban districts. The National Center for Education Statistics (NCES) is currently…
Descriptors: National Competency Tests, Innovation, Futures (of Society), Achievement Tests
Arnò, Simone; Galassi, Alessandra; Tommasi, Marco; Saggino, Aristide; Vittorini, Pierpaolo – International Journal of Distance Education Technologies, 2021
Online proctoring generally refers to the practice of proctors monitoring an exam over the internet, usually through a webcam. This technology has gained relevance during the current COVID-19 pandemic, given that the social distance owing to health reasons has consequently led to the switching of all learning and assessment activities to online…
Descriptors: Supervision, Computer Assisted Testing, Electronic Learning, Educational Technology
Li, Jie; van der Linden, Wim J. – Journal of Educational Measurement, 2018
The final step of the typical process of developing educational and psychological tests is to place the selected test items in a formatted form. The step involves the grouping and ordering of the items to meet a variety of formatting constraints. As this activity tends to be time-intensive, the use of mixed-integer programming (MIP) has been…
Descriptors: Programming, Automation, Test Items, Test Format
Sekeres, Melanie J.; Moscovitch, Morris; Grady, Cheryl L.; Sullens, D. Gregory; Winocur, Gordon – Learning & Memory, 2020
Conditioned fear memories that are context-specific shortly after conditioning generalize over time. We exposed rats to a context reminder 30 d after conditioning, which served to reinstate context-specificity, and investigated how this reminder alters retrieval-induced activity in the hippocampus and anterior cingulate cortex (aCC) relative to a…
Descriptors: Memory, Animals, Brain Hemisphere Functions, Conditioning
DePascale, Charlie; Dadey, Nathan; Lyons, Susan – Council of Chief State School Officers, 2016
The purpose of this document is to provide information and advice to support states in meeting USED Peer Review Requirements related to demonstrating the comparability of test scores across various devices used for technology-based testing. The document is divided into four main sections. In the first section, we discuss and define the…
Descriptors: Computer Assisted Testing, Scores, Intermode Differences, Influence of Technology
Moothedath, Shana; Chaporkar, Prasanna; Belur, Madhu N. – Perspectives in Education, 2016
In recent years, the computerised adaptive test (CAT) has gained popularity over conventional exams in evaluating student capabilities with desired accuracy. However, the key limitation of CAT is that it requires a large pool of pre-calibrated questions. In the absence of such a pre-calibrated question bank, offline exams with uncalibrated…
Descriptors: Guessing (Tests), Computer Assisted Testing, Adaptive Testing, Maximum Likelihood Statistics
Heritage, Margaret; Kingston, Neal M. – Journal of Educational Measurement, 2019
Classroom assessment and large-scale assessment have, for the most part, existed in mutual isolation. Some experts have felt this is for the best and others have been concerned that the schism limits the potential contribution of both forms of assessment. Margaret Heritage has long been a champion of best practices in classroom assessment. Neal…
Descriptors: Measurement, Psychometrics, Context Effect, Classroom Environment
von Davier, Matthias; Khorramdel, Lale; He, Qiwei; Shin, Hyo Jeong; Chen, Haiwen – Journal of Educational and Behavioral Statistics, 2019
International large-scale assessments (ILSAs) transitioned from paper-based assessments to computer-based assessments (CBAs) facilitating the use of new item types and more effective data collection tools. This allows implementation of more complex test designs and to collect process and response time (RT) data. These new data types can be used to…
Descriptors: International Assessment, Computer Assisted Testing, Psychometrics, Item Response Theory
Papageorgiou, Spiros; Manna, Venessa F. – Language Assessment Quarterly, 2021
The TOEFL iBT test was introduced in 2005 to better reflect the language demands of real-life academic tasks than did previous versions of the test. The task-based design of the test was intended to support the interpretation of its scores as a trustworthy measure of international students' ability to use English in an academic environment. Until…
Descriptors: Academic Language, COVID-19, Pandemics, Scores
Ghaderi, Marzieh; Mogholi, Marzieh; Soori, Afshin – International Journal of Education and Literacy Studies, 2014
Testing subject has many subsets and connections. One important issue is how to assess or measure students or learners. What would be our tools, what would be our style, what would be our goal and so on. So in this paper the author attended to the style of testing in school and other educational settings. Since the purposes of educational system…
Descriptors: Testing, Testing Programs, Intermode Differences, Computer Assisted Testing