ERIC - Search Results

Publication Date

In 2025	0
Since 2024	3
Since 2021 (last 5 years)	5
Since 2016 (last 10 years)	7
Since 2006 (last 20 years)	10

Source

Applied Psychological…	2
American Annals of the Deaf	1
Applied Measurement in…	1
Council of Chief State School…	1
Discover Education	1
ETS Research Report Series	1
Educational Evaluation and…	1
Educational and Psychological…	1
Grantee Submission	1
Journal of Environmental…	1
Journal of Research and…	1
Large-scale Assessments in…	1
Psychological Assessment	1
Studies in Second Language…	1
More ▼

Publication Type

Information Analyses	17
Journal Articles	14
Reports - Research	4
Reports - Evaluative	3
Speeches/Meeting Papers	2
Opinion Papers	1

Education Level

Elementary Secondary Education	2
Higher Education	2
Postsecondary Education	2

Audience

Practitioners

Location

Minnesota

Laws, Policies, & Programs

Assessments and Surveys

General Educational…	1
Minnesota Multiphasic…	1
National Assessment of…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 17 results Save | Export

Practical Considerations in Item Calibration with Small Samples under Multistage Test Design: A Case Study. Research Report. ETS RR-24-03

Peer reviewed
PDF on ERIC

Download full text

Hongwen Guo; Matthew S. Johnson; Daniel F. McCaffrey; Lixong Gu – ETS Research Report Series, 2024

The multistage testing (MST) design has been gaining attention and popularity in educational assessments. For testing programs that have small test-taker samples, it is challenging to calibrate new items to replenish the item pool. In the current research, we used the item pools from an operational MST program to illustrate how research studies…

Descriptors: Test Items, Test Construction, Sample Size, Scaling

The Use of Process Data in Large-Scale Assessments: A Literature Review

Peer reviewed

Direct link

Ella Anghel; Lale Khorramdel; Matthias von Davier – Large-scale Assessments in Education, 2024

As the use of process data in large-scale educational assessments is becoming more common, it is clear that data on examinees' test-taking behaviors can illuminate their performance, and can have crucial ramifications concerning assessments' validity. A thorough review of the literature in the field may inform researchers and practitioners of…

Descriptors: Educational Assessment, Test Validity, Test Items, Reaction Time

The Utility and Limitations of the New Ecological Paradigm Scale for Children

Peer reviewed

Direct link

Rosa, Claudio D.; Collado, Silvia; Larson, Lincoln R. – Journal of Environmental Education, 2022

The New Ecological Paradigm (NEP) scale adapted for use with children (NEP-C) is one of the most frequently used measures of children's environmental beliefs. Though widely utilized, the limitations of the NEP-C instrument are often overlooked. Based on a systematic synthesis of existing literature examining the NEP-C, we argue that the scale…

Descriptors: Attitude Measures, Children, Environment, Beliefs

The Intent of ChatGPT Usage and Its Robustness in Medical Proficiency Exams: A Systematic Review

Peer reviewed

Direct link

Tatiana Chaiban; Zeinab Nahle; Ghaith Assi; Michelle Cherfane – Discover Education, 2024

Background: Since it was first launched, ChatGPT, a Large Language Model (LLM), has been widely used across different disciplines, particularly the medical field. Objective: The main aim of this review is to thoroughly assess the performance of the distinct version of ChatGPT in subspecialty written medical proficiency exams and the factors that…

Descriptors: Medical Education, Accuracy, Artificial Intelligence, Computer Software

Measurement Properties of a Standardized Elicited Imitation Test: An Integrative Data Analysis

Peer reviewed

Direct link

Isbell, Daniel R.; Son, Young-A – Studies in Second Language Acquisition, 2022

Elicited Imitation Tests (EITs) are commonly used in second language acquisition (SLA)/bilingualism research contexts to assess the general oral proficiency of study participants. While previous studies have provided valuable EIT construct-related validity evidence, some key gaps remain. This study uses an integrative data analysis to further…

Descriptors: Bilingualism, Imitation, Language Tests, Second Language Learning

Supplement to "Score Comparability across Computerized Assessment Delivery Devices": An Update on Literature Produced since the June 2016 Report

Download full text

Council of Chief State School Officers, 2020

Any body of research evolves over time. Previous understandings become more nuanced, ideas are supported or rebuked, and, eventually we arrive at a clearer view of the issue. The research on score comparability across computerized devices is no exception. CCSSO [Council of Chief State School Officers] and the Center for Assessment have published…

Descriptors: Computer Assisted Testing, Scores, Intermode Differences, Influence of Technology

The Comparability of Scores from Different Digital Devices: A Literature Review and Synthesis with Recommendations for Practice

Peer reviewed

Direct link

Dadey, Nathan; Lyons, Susan; DePascale, Charles – Applied Measurement in Education, 2018

Evidence of comparability is generally needed whenever there are variations in the conditions of an assessment administration, including variations introduced by the administration of an assessment on multiple digital devices (e.g., tablet, laptop, desktop). This article is meant to provide a comprehensive examination of issues relevant to the…

Descriptors: Evaluation Methods, Computer Assisted Testing, Educational Technology, Technology Uses in Education

Assessment Accommodations on Tests of Academic Achievement for Students Who Are Deaf or Hard of Hearing: A Qualitative Meta-Analysis of the Research Literature

Peer reviewed
PDF on ERIC

Download full text

Direct link

Cawthon, Stephanie; Leppo, Rachel – American Annals of the Deaf, 2013

The authors conducted a qualitative meta-analysis of the research on assessment accommodations for students who are deaf or hard of hearing. There were 16 identified studies that analyzed the impact of factors related to student performance on academic assessments across different educational settings, content areas, and types of assessment…

Descriptors: Testing Accommodations, Academic Achievement, Deafness, Hearing Impairments

Assessment Accommodations on Tests of Academic Achievement for Students Who Are Deaf or Hard of Hearing: A Qualitative Meta-Analysis of the Research Literature

Peer reviewed
PDF on ERIC

Download full text

Direct link

Cawthon, Stephanie; Leppo, Rachel – Grantee Submission, 2013

Descriptors: Testing Accommodations, Academic Achievement, Deafness, Hearing Impairments

Computerized Adaptive Personality Testing: A Review and Illustration With the MMPI-2 Computerized Adaptive Version.

Peer reviewed

Direct link

Forbey, Johnathan D.; Ben-Porath, Yossef S. – Psychological Assessment, 2007

Computerized adaptive testing in personality assessment can improve efficiency by significantly reducing the number of items administered to answer an assessment question. Two approaches have been explored for adaptive testing in computerized personality assessment: item response theory and the countdown method. In this article, the authors…

Descriptors: Personality Traits, Computer Assisted Testing, Test Validity, Personality Assessment

A Supplement to "The Number of Guttman Errors as a Simple and Powerful Person-Fit Statistic."

Peer reviewed

Meijer, Rob R. – Applied Psychological Measurement, 1995

A statistic used by R. Meijer (1994) to determine person-fit referred to the number of errors from the deterministic Guttman model (L. Guttman, 1950), but this was, in fact, based on the number of errors from the deterministic Guttman model as defined by J. Loevinger (1947, 1948). (SLD)

Descriptors: Difficulty Level, Models, Responses, Scaling

Methodology Review: Nonparametric IRT Approaches to the Analysis of Dichotomous Item Scores.

Peer reviewed

Sijtsma, Klaas – Applied Psychological Measurement, 1998

Reviews developments in nonparametric item-response theory (NIRT), from its historic origins in item-response theory (IRT) and scale analysis to new theoretical results for practical test construction. Discusses theoretical results from NIRT often relevant to IRT. Contains 134 references. (SLD)

Descriptors: Item Response Theory, Nonparametric Statistics, Research Methodology, Scores

The Reliability and Validity of the GED Tests. GED Research Brief No. 6.

Whitney, Douglas R.; And Others – 1985

This research brief summarizes the available reliability and validity data available in, but spread throughout, a number of General Educational Development (GED) Testing Service publications. A section on reliability discusses how to determine reliability of a test's scores and two ways of assessing the reliability of a test--internal consistency…

Descriptors: Adult Education, High School Equivalency Programs, Item Analysis, Scores

Testing with Multiple-Choice Items.

Peer reviewed

Aiken, Lewis R. – Journal of Research and Development in Education, 1987

A critical review is presented of research conducted during the past 20 years on multiple-choice tests of achievement and aptitude. The design and use of multiple-choice tests is emphasized, but information concerning the socioeducational implications of relying on such tests is also included. (Author/CB)

Descriptors: Academic Achievement, Academic Aptitude, Educational Sociology, Multiple Choice Tests

A Redefinition of Content Validity.

Peer reviewed

Benson, Jeri – Educational and Psychological Measurement, 1981

A review of the research on item writing, item format, test instructions, and item readability indicated the importance of instrument structure in the interpretation of test data. The effect of failing to consider these areas on the content validity of achievement test scores is discussed. (Author/GK)

Descriptors: Achievement Tests, Elementary Secondary Education, Literature Reviews, Scores

Previous Page | Next Page »

Pages: 1 | 2

Scores	17
Test Items	17
Test Validity	7
Test Construction	6
Computer Assisted Testing	5
Elementary Secondary Education	4
Literature Reviews	4
Academic Achievement	3
Databases	3
Item Analysis	3
Item Response Theory	3
Standardized Tests	3
Test Use	3
Achievement Tests	2
American Sign Language	2
Comparative Analysis	2
Computers	2
Data Collection	2
Deafness	2
Difficulty Level	2
Educational Environment	2
Evaluation Methods	2
Familiarity	2
Handheld Devices	2
Hearing Impairments	2
More ▼

Cawthon, Stephanie	2
Leppo, Rachel	2
Aiken, Lewis R.	1
Ben-Porath, Yossef S.	1
Benson, Jeri	1
Buser, Karen	1
Collado, Silvia	1
Dadey, Nathan	1
Daniel F. McCaffrey	1
DePascale, Charles	1
Ella Anghel	1
Forbey, Johnathan D.	1
Ghaith Assi	1
Hongwen Guo	1
Isbell, Daniel R.	1
Koretz, Daniel	1
Lale Khorramdel	1
Larson, Lincoln R.	1
Lixong Gu	1
Lyons, Susan	1
Matthew S. Johnson	1
Matthias von Davier	1
Meijer, Rob R.	1
Michelle Cherfane	1
More ▼