ERIC - Search Results

Publication Date

In 2025	1
Since 2024	4
Since 2021 (last 5 years)	7
Since 2016 (last 10 years)	17
Since 2006 (last 20 years)	29

Descriptor

Test Construction	81
Computer Assisted Testing	34
Testing Problems	25
Educational Testing	23
Elementary Secondary Education	23
Test Use	23
Test Items	20
Test Validity	19
Testing Programs	18
Educational Assessment	12
Microcomputers	12
Evaluation Methods	11
Item Banks	10
Testing	10
Achievement Tests	9
Computer Software	9
Measurement	9
Standards	9
Test Bias	9
Adaptive Testing	8
Item Analysis	8
Item Response Theory	8
State Programs	8
Psychometrics	7
School Districts	7
More ▼

Source

Educational Measurement:…

Publication Type

Journal Articles	81
Reports - Descriptive	29
Reports - Evaluative	22
Opinion Papers	19
Reports - Research	12
Information Analyses	6
Book/Product Reviews	3
Collected Works - Serials	1
Guides - Non-Classroom	1
Speeches/Meeting Papers	1
Tests/Questionnaires	1
More ▼

Education Level

Elementary Secondary Education	3
Adult Education	1
Elementary Education	1
Grade 3	1
Grade 4	1
Grade 5	1
Grade 9	1
High Schools	1
Higher Education	1
Junior High Schools	1
Middle Schools	1
Postsecondary Education	1
Secondary Education	1
More ▼

Audience

Researchers

Location

Sweden	2
Canada	1
Israel	1
Nebraska	1
Netherlands	1
New York (New York)	1
Pennsylvania	1
Singapore	1
Texas	1
United Kingdom	1
United States	1
West Virginia	1
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001

Assessments and Surveys

National Assessment of…	2
Teacher Performance…	2
California Achievement Tests	1
Program for International…	1
Progress in International…	1
Stanford Achievement Tests	1
Stanford Binet Intelligence…	1
Trends in International…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 81 results Save | Export

Item Response Theory Models for Polytomous Multidimensional Forced-Choice Items to Measure Construct Differentiation

Peer reviewed

Direct link

Xuelan Qiu; Jimmy de la Torre; You-Gan Wang; Jinran Wu – Educational Measurement: Issues and Practice, 2024

Multidimensional forced-choice (MFC) items have been found to be useful to reduce response biases in personality assessments. However, conventional scoring methods for the MFC items result in ipsative data, hindering the wider applications of the MFC format. In the last decade, a number of item response theory (IRT) models have been developed,…

Descriptors: Item Response Theory, Personality Traits, Personality Measures, Personality Assessment

Applying Evidence-Centered Design in the Development of a Multidimensional Adaptive Reading Motivation Measure

Peer reviewed

Direct link

Wang, Wenhao; Kingston, Neal M.; Davis, Marcia H.; Tiemann, Gail C.; Tonks, Stephen; Hock, Michael – Educational Measurement: Issues and Practice, 2021

Adaptive tests are more efficient than fixed-length tests through the use of item response theory; adaptive tests also present students questions that are tailored to their proficiency level. Although the adaptive algorithm is straightforward, developing a multidimensional computer adaptive test (MCAT) measure is complex. Evidence-centered design…

Descriptors: Evidence Based Practice, Reading Motivation, Adaptive Testing, Computer Assisted Testing

Evolving Educational Testing to Meet Students' Needs: Design-in-Real-Time Assessment

Peer reviewed

Direct link

Stephen G. Sireci; Javier Suárez-Álvarez; April L. Zenisky; Maria Elena Oliveri – Educational Measurement: Issues and Practice, 2024

The goal in personalized assessment is to best fit the needs of each individual test taker, given the assessment purposes. Design-in-Real-Time (DIRTy) assessment reflects the progressive evolution in testing from a single test, to an adaptive test, to an adaptive assessment "system." In this article, we lay the foundation for DIRTy…

Descriptors: Educational Assessment, Student Needs, Test Format, Test Construction

Exploration of Latent Structure in Test Revision and Review Log Data

Peer reviewed

Direct link

Zhang, Susu; Li, Anqi; Wang, Shiyu – Educational Measurement: Issues and Practice, 2023

In computer-based tests allowing revision and reviews, examinees' sequence of visits and answer changes to questions can be recorded. The variable-length revision log data introduce new complexities to the collected data but, at the same time, provide additional information on examinees' test-taking behavior, which can inform test development and…

Descriptors: Computer Assisted Testing, Test Construction, Test Wiseness, Test Items

Instruction-Tuned Large-Language Models for Quality Control in Automatic Item Generation: A Feasibility Study

Peer reviewed

Direct link

Guher Gorgun; Okan Bulut – Educational Measurement: Issues and Practice, 2025

Automatic item generation may supply many items instantly and efficiently to assessment and learning environments. Yet, the evaluation of item quality persists to be a bottleneck for deploying generated items in learning and assessment settings. In this study, we investigated the utility of using large-language models, specifically Llama 3-8B, for…

Descriptors: Artificial Intelligence, Quality Control, Technology Uses in Education, Automation

Internet-Based Proctored Assessment: Security and Fairness Issues

Peer reviewed

Direct link

Langenfeld, Thomas – Educational Measurement: Issues and Practice, 2020

The COVID-19 pandemic has accelerated the shift toward online learning solutions necessitating the need for developing online assessment solutions. Vendors offer online assessment delivery systems with varying security levels designed to minimize unauthorized behaviors. Combating cheating and securing assessment content, however, is not solely the…

Descriptors: Computer Assisted Testing, Justice, COVID-19, Pandemics

Generating Performance-Level Descriptors under a Principled Assessment Design Paradigm: An Example for Assessments under the Next-Generation Science Standards

Peer reviewed

Direct link

Luecht, Richard M. – Educational Measurement: Issues and Practice, 2020

The educational testing landscape is changing in many significant ways as evidence-based, principled assessment design (PAD) approaches are formally adopted. This article discusses the challenges and presents some score scale- and task-focused strategies for developing useful performance-level descriptors (PLDs) under a PAD approach. Details of…

Descriptors: Test Construction, Academic Standards, Science Education, Educational Testing

Reframing Research and Assessment Practices: Advancing an Antiracist and Anti-Ableist Research Agenda

Peer reviewed

Direct link

Angela Johnson; Elizabeth Barker; Marcos Viveros Cespedes – Educational Measurement: Issues and Practice, 2024

Educators and researchers strive to build policies and practices on data and evidence, especially on academic achievement scores. When assessment scores are inaccurate for specific student populations or when scores are inappropriately used, even data-driven decisions will be misinformed. To maximize the impact of the research-practice-policy…

Descriptors: Equal Education, Inclusion, Evaluation Methods, Error of Measurement

Adding Objectivity to Standard Setting: Evaluating Consequence Using the Conscious and Subconscious Weight Methods

Peer reviewed

Direct link

Leventhal, Brian C.; Grabovsky, Irina – Educational Measurement: Issues and Practice, 2020

Standard setting is arguably one of the most subjective techniques in test development and psychometrics. The decisions when scores are compared to standards, however, are arguably the most consequential outcomes of testing. Providing licensure to practice in a profession has high stake consequences for the public. Denying graduation or forcing…

Descriptors: Standard Setting (Scoring), Weighted Scores, Test Construction, Psychometrics

Examining Effectiveness and Validity of Accommodations for English Language Learners in Mathematics: An Evidence-Based Computer Accommodation Decision System

Peer reviewed

Direct link

Abedi, Jamal; Zhang, Yu; Rowe, Susan E.; Lee, Hansol – Educational Measurement: Issues and Practice, 2020

Research indicates that the performance-gap between English Language Learners (ELLs) and their non-ELL peers is partly due to ELLs' difficulty in understanding assessment language. Accommodations have been shown to narrow this performance-gap, but many accommodations studies have not used a randomized design and are based on relatively small…

Descriptors: English Language Learners, Achievement Gap, Mathematics Tests, Standards

The Effect of Drag-and-Drop Item Features on Test-Taker Performance and Response Strategies

Peer reviewed

Direct link

Arslan, Burcu; Jiang, Yang; Keehner, Madeleine; Gong, Tao; Katz, Irvin R.; Yan, Fred – Educational Measurement: Issues and Practice, 2020

Computer-based educational assessments often include items that involve drag-and-drop responses. There are different ways that drag-and-drop items can be laid out and different choices that test developers can make when designing these items. Currently, these decisions are based on experts' professional judgments and design constraints, rather…

Descriptors: Test Items, Computer Assisted Testing, Test Format, Decision Making

Preparing the Next Generation of Educational Measurement Specialists: A Call for Programs with an Integrated Scope and Sequence

Peer reviewed

Direct link

Russell, Mike; Ludlow, Larry; O'Dwyer, Laura – Educational Measurement: Issues and Practice, 2019

The field of educational measurement has evolved considerably since the first doctoral programs were established. In response, programs have typically tacked on courses that address newly developed theories, methods, tools, and techniques. As our review of current programs evidences, this approach produces artificial distinctions among topics and…

Descriptors: Educational Testing, Specialists, Doctoral Programs, Program Evaluation

Development and Validation of an Automatic Item Generation System for English Idioms

Peer reviewed

Direct link

Rafatbakhsh, Elaheh; Ahmadi, Alireza; Moloodi, Amirsaeid; Mehrpour, Saeed – Educational Measurement: Issues and Practice, 2021

Test development is a crucial, yet difficult and time-consuming part of any educational system, and the task often falls all on teachers. Automatic item generation systems have recently drawn attention as they can reduce this burden and make test development more convenient. Such systems have been developed to generate items for vocabulary,…

Descriptors: Test Construction, Test Items, Computer Assisted Testing, Multiple Choice Tests

Digital Module 09: Sociocognitive Assessment for Diverse Populations

Peer reviewed

Direct link

Mislevy, Robert J.; Oliveri, Maria Elena – Educational Measurement: Issues and Practice, 2019

In this digital ITEMS module, Dr. Robert [Bob] Mislevy and Dr. Maria Elena Oliveri introduce and illustrate a sociocognitive perspective on educational measurement, which focuses on a variety of design and implementation considerations for creating fair and valid assessments for learners from diverse populations with diverse sociocultural…

Descriptors: Educational Testing, Reliability, Test Validity, Test Reliability

Digital Module 10: Rasch Measurement Theory

Peer reviewed

Direct link

Wang, Jue; Engelhard, George, Jr. – Educational Measurement: Issues and Practice, 2019

In this digital ITEMS module, Dr. Jue Wang and Dr. George Engelhard Jr. describe the Rasch measurement framework for the construction and evaluation of new measures and scales. From a theoretical perspective, they discuss the historical and philosophical perspectives on measurement with a focus on Rasch's concept of specific objectivity and…

Descriptors: Item Response Theory, Evaluation Methods, Measurement, Goodness of Fit

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6

Fremer, John	2
Hiscox, Michael D.	2
Mislevy, Robert J.	2
Nichols, Paul D.	2
Stone, Clement A.	2
Abedi, Jamal	1
Ahmadi, Alireza	1
Anderson, Beverly L.	1
Angela Johnson	1
April L. Zenisky	1
Arffman, Inga	1
Armstrong, Anne-Marie	1
Arslan, Burcu	1
Averitt, Jason	1
Balizet, Sha	1
Bond, Lloyd	1
Bottsford-Miller, Nicole A.	1
Brookhart, Susan M.	1
Brzezinski, Evelyn J.	1
Burling, Kelly S.	1
Capie, William	1
Carter, Kathy	1
Chudowsky, Naomi	1
Cole, Nancy S.	1
More ▼