ERIC - Search Results

Publication Date

In 2025	1
Since 2024	2
Since 2021 (last 5 years)	8
Since 2016 (last 10 years)	21
Since 2006 (last 20 years)	82

Descriptor

Comparative Analysis	169
Testing	49
Computer Assisted Testing	37
Academic Achievement	34
Foreign Countries	33
Evaluation Methods	27
Hypothesis Testing	26
Elementary Secondary Education	23
Higher Education	23
Educational Assessment	22
Testing Programs	21
Educational Testing	19
Test Items	19
Test Results	18
Comparative Testing	17
Test Construction	17
Scores	16
Standardized Tests	14
Student Evaluation	14
Scoring	13
Testing Problems	13
Achievement Tests	12
Research Methodology	12
School Districts	12
State Programs	12
More ▼

Publication Type

Reports - Descriptive	169
Journal Articles	106
Numerical/Quantitative Data	13
Speeches/Meeting Papers	12
Reports - Research	7
Reports - Evaluative	5
Information Analyses	4
Historical Materials	3
Collected Works - Serials	2
Opinion Papers	2
Books	1
Collected Works - General	1
Guides - General	1
Guides - Non-Classroom	1
Legal/Legislative/Regulatory…	1
Tests/Questionnaires	1
More ▼

Education Level

Elementary Secondary Education	26
Higher Education	17
Secondary Education	10
High Schools	8
Elementary Education	6
Postsecondary Education	6
Grade 4	4
Adult Education	3
Grade 8	3
Intermediate Grades	3
Junior High Schools	3
Middle Schools	3
Grade 12	1
Grade 5	1
High School Equivalency…	1
More ▼

Audience

Policymakers	6
Researchers	5
Practitioners	3
Teachers	3
Administrators	1
Community	1
Parents	1
Students	1

Location

Australia	8
United States	7
United Kingdom	5
United Kingdom (England)	5
Connecticut	4
New York	4
North Carolina	4
Canada	3
Hong Kong	2
Kansas	2
Malaysia	2
Mexico	2
New Hampshire	2
Rhode Island	2
Vermont	2
Afghanistan	1
Bangladesh	1
Bhutan	1
Botswana	1
California	1
Cambodia	1
Chile	1
Colombia	1
Czech Republic	1
Denmark	1
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	3
Every Student Succeeds Act…	2
Elementary and Secondary…	1
Elementary and Secondary…	1

Assessments and Surveys

National Assessment of…	12
Program for International…	9
Trends in International…	4
Comprehensive Tests of Basic…	3
New York State Regents…	3
Progress in International…	3
California Achievement Tests	2
Iowa Tests of Basic Skills	2
North Carolina End of Course…	2
Test of English as a Foreign…	2
General Educational…	1
Graduate Record Examinations	1
Measures of Academic Progress	1
Metropolitan Achievement Tests	1
Stanford Achievement Tests	1
Test of Economic Literacy	1
More ▼

What Works Clearinghouse Rating

Showing 1 to 15 of 169 results Save | Export

A Robustness Test Protocol for Applied QCA: Theory and R Software Application

Peer reviewed

Direct link

Ioana-Elena Oana; Carsten Q. Schneider – Sociological Methods & Research, 2024

The robustness of qualitative comparative analysis (QCA) results features high on the agenda of methodologists and practitioners. This article aims at advancing this debate on several fronts. First, in line with the extant literature, we take a comprehensive view on robustness arguing that decisions on calibration, consistency, and frequency…

Descriptors: Robustness (Statistics), Qualitative Research, Comparative Analysis, Decision Making

Taking Causal Heterogeneity Seriously: Implications for Case Choice and Case Study-Based Generalizations

Peer reviewed

Direct link

Hertog, Steffen – Sociological Methods & Research, 2023

In mixed methods approaches, statistical models are used to identify "nested" cases for intensive, small-n investigation for a range of purposes, including notably the examination of causal mechanisms. This article shows that under a commonsense interpretation of causal effects, large-n models allow no reliable conclusions about effect…

Descriptors: Case Studies, Generalization, Prediction, Mixed Methods Research

Historical Perspectives on Score Comparability Issues Raised by Innovations in Testing

Peer reviewed

Direct link

Baldwin, Peter; Clauser, Brian E. – Journal of Educational Measurement, 2022

While score comparability across test forms typically relies on common (or randomly equivalent) examinees or items, innovations in item formats, test delivery, and efforts to extend the range of score interpretation may require a special data collection before examinees or items can be used in this way--or may be incompatible with common examinee…

Descriptors: Scoring, Testing, Test Items, Test Format

Hybrid Maximum Clique Algorithm Using Parallel Integer Programming for Uniform Test Assembly

Peer reviewed

Direct link

Fuchimoto, Kazuma; Ishii, Takatoshi; Ueno, Maomi – IEEE Transactions on Learning Technologies, 2022

Educational assessments often require uniform test forms, for which each test form has equivalent measurement accuracy but with a different set of items. For uniform test assembly, an important issue is the increase of the number of assembled uniform tests. Although many automatic uniform test assembly methods exist, the maximum clique algorithm…

Descriptors: Simulation, Efficiency, Test Items, Educational Assessment

Score Comparability Issues with At-Home Testing and How to Address Them

Peer reviewed

Direct link

Puhan, Gautam; Kim, Sooyeon – Journal of Educational Measurement, 2022

As a result of the COVID-19 pandemic, at-home testing has become a popular delivery mode in many testing programs. When programs offer at-home testing to expand their service, the score comparability between test takers testing remotely and those testing in a test center is critical. This article summarizes statistical procedures that could be…

Descriptors: Scores, Scoring, Comparative Analysis, Testing

NAEP 2025 Field Test: FAQs about NAEP Modernizations

Peer reviewed
PDF on ERIC

Download full text

National Assessment of Educational Progress (NAEP), 2025

Also known as The Nation's Report Card, the National Assessment of Educational Progress (NAEP) is the largest nationally representative and continuing assessment of student achievement and their learning experiences in various subjects for the nation, states, and 27 urban districts. The National Center for Education Statistics (NCES) is currently…

Descriptors: National Competency Tests, Innovation, Futures (of Society), Achievement Tests

State-of-the-Art of Commercial Proctoring Systems and Their Use in Academic Online Exams

Peer reviewed

Direct link

Arnò, Simone; Galassi, Alessandra; Tommasi, Marco; Saggino, Aristide; Vittorini, Pierpaolo – International Journal of Distance Education Technologies, 2021

Online proctoring generally refers to the practice of proctors monitoring an exam over the internet, usually through a webcam. This technology has gained relevance during the current COVID-19 pandemic, given that the social distance owing to health reasons has consequently led to the switching of all learning and assessment activities to online…

Descriptors: Supervision, Computer Assisted Testing, Electronic Learning, Educational Technology

A Comparison of Constraint Programming and Mixed-Integer Programming for Automated Test-Form Generation

Peer reviewed

Direct link

Li, Jie; van der Linden, Wim J. – Journal of Educational Measurement, 2018

The final step of the typical process of developing educational and psychological tests is to place the selected test items in a formatted form. The step involves the grouping and ordering of the items to meet a variety of formatting constraints. As this activity tends to be time-intensive, the use of mixed-integer programming (MIP) has been…

Descriptors: Programming, Automation, Test Items, Test Format

Reminders Reinstate Context-Specificity to Generalized Remote Memories in Rats: Relation to Activity in the Hippocampus and aCC

Peer reviewed

Direct link

Sekeres, Melanie J.; Moscovitch, Morris; Grady, Cheryl L.; Sullens, D. Gregory; Winocur, Gordon – Learning & Memory, 2020

Conditioned fear memories that are context-specific shortly after conditioning generalize over time. We exposed rats to a context reminder 30 d after conditioning, which served to reinstate context-specificity, and investigated how this reminder alters retrieval-induced activity in the hippocampus and anterior cingulate cortex (aCC) relative to a…

Descriptors: Memory, Animals, Brain Hemisphere Functions, Conditioning

Score Comparability across Computerized Assessment Delivery Devices: Defining Comparability, Reviewing the Literature, and Providing Recommendations for States When Submitting to Title 1 Peer Review

Download full text

DePascale, Charlie; Dadey, Nathan; Lyons, Susan – Council of Chief State School Officers, 2016

The purpose of this document is to provide information and advice to support states in meeting USED Peer Review Requirements related to demonstrating the comparability of test scores across various devices used for technology-based testing. The document is divided into four main sections. In the first section, we discuss and define the…

Descriptors: Computer Assisted Testing, Scores, Intermode Differences, Influence of Technology

A Maximum Likelihood Based Offline Estimation of Student Capabilities and Question Difficulties with Guessing

Peer reviewed

Direct link

Moothedath, Shana; Chaporkar, Prasanna; Belur, Madhu N. – Perspectives in Education, 2016

In recent years, the computerised adaptive test (CAT) has gained popularity over conventional exams in evaluating student capabilities with desired accuracy. However, the key limitation of CAT is that it requires a large pool of pre-calibrated questions. In the absence of such a pre-calibrated question bank, offline exams with uncalibrated…

Descriptors: Guessing (Tests), Computer Assisted Testing, Adaptive Testing, Maximum Likelihood Statistics

Classroom Assessment and Large-Scale Psychometrics: Shall the Twain Meet? (A Conversation with Margaret Heritage and Neal Kingston)

Peer reviewed

Direct link

Heritage, Margaret; Kingston, Neal M. – Journal of Educational Measurement, 2019

Classroom assessment and large-scale assessment have, for the most part, existed in mutual isolation. Some experts have felt this is for the best and others have been concerned that the schism limits the potential contribution of both forms of assessment. Margaret Heritage has long been a champion of best practices in classroom assessment. Neal…

Descriptors: Measurement, Psychometrics, Context Effect, Classroom Environment

Developments in Psychometric Population Models for Technology-Based Large-Scale Assessments: An Overview of Challenges and Opportunities

Peer reviewed

Direct link

von Davier, Matthias; Khorramdel, Lale; He, Qiwei; Shin, Hyo Jeong; Chen, Haiwen – Journal of Educational and Behavioral Statistics, 2019

International large-scale assessments (ILSAs) transitioned from paper-based assessments to computer-based assessments (CBAs) facilitating the use of new item types and more effective data collection tools. This allows implementation of more complex test designs and to collect process and response time (RT) data. These new data types can be used to…

Descriptors: International Assessment, Computer Assisted Testing, Psychometrics, Item Response Theory

Maintaining Access to a Large-Scale Test of Academic Language Proficiency during the Pandemic: The Launch of TOEFL iBT Home Edition

Peer reviewed

Direct link

Papageorgiou, Spiros; Manna, Venessa F. – Language Assessment Quarterly, 2021

The TOEFL iBT test was introduced in 2005 to better reflect the language demands of real-life academic tasks than did previous versions of the test. The task-based design of the test was intended to support the interpretation of its scores as a trustworthy measure of international students' ability to use English in an academic environment. Until…

Descriptors: Academic Language, COVID-19, Pandemics, Scores

Comparing between Computer Based Tests and Paper-and-Pencil Based Tests

Peer reviewed
PDF on ERIC

Download full text

Ghaderi, Marzieh; Mogholi, Marzieh; Soori, Afshin – International Journal of Education and Literacy Studies, 2014

Testing subject has many subsets and connections. One important issue is how to assess or measure students or learners. What would be our tools, what would be our style, what would be our goal and so on. So in this paper the author attended to the style of testing in school and other educational settings. Since the purposes of educational system…

Descriptors: Testing, Testing Programs, Intermode Differences, Computer Assisted Testing

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12

Journal of Educational…	4
International Journal of…	3
Journal of Educational…	3
Journal of Research on…	3
Council of Chief State School…	2
Educational Assessment	2
Educational Researcher	2
Educational and Psychological…	2
IEEE Transactions on Learning…	2
Internet Research	2
Journal of Educational and…	2
Language Assessment Quarterly	2
Language Learning	2
Proceedings of the ASIS…	2
Research Papers in Education	2
Sociological Methods &…	2
Teaching of Psychology	2
ACT, Inc.	1
Academic Medicine	1
Advances in Physiology…	1
American Biology Teacher	1
American Institutes for…	1
Applied Linguistics	1
Australian Council for…	1
Australian Senior Mathematics…	1
More ▼

Clariana, Roy B.	2
Darling-Hammond, Linda	2
Torney-Purta, Judith	2
Abedi, Jamal	1
Alonso Rivas, Javier	1
Anbar, Michael	1
Arias, Beatriz	1
Arnett, Patricia L.	1
Arnò, Simone	1
Bacon, Donald R.	1
Baird, Matthew	1
Baldwin, Peter	1
Banks, Kathleen	1
Baron, Joan Boykoff	1
Bartram, Dave	1
Belur, Madhu N.	1
Benderson, Albert, Ed.	1
Beretvas, S. Natasha	1
Binder, Amy J.	1
Bley-Vroman, Robert	1
Bohlen, Michael J.	1
Borsuk, Alan J.	1
Bowen, Daniel	1
Bradbury, Alice	1
More ▼