ERIC - Search Results

Publication Date

In 2026	0
Since 2025	3
Since 2022 (last 5 years)	17
Since 2017 (last 10 years)	29
Since 2007 (last 20 years)	39

Descriptor

Test Items	222
Test Use	222
Test Construction	109
Test Validity	56
Scoring	49
Test Format	41
Foreign Countries	40
Achievement Tests	39
Elementary Secondary Education	38
Testing Programs	38
Educational Assessment	34
Test Reliability	34
Higher Education	32
Scores	32
Language Tests	28
Standardized Tests	28
Student Evaluation	28
Test Interpretation	28
Test Results	28
Computer Assisted Testing	27
Item Analysis	26
Item Response Theory	25
State Programs	25
Evaluation Methods	24
Multiple Choice Tests	23
More ▼

Education Level

Elementary Secondary Education	11
Elementary Education	9
Higher Education	9
Postsecondary Education	9
Secondary Education	7
Grade 6	4
Early Childhood Education	2
High Schools	2
Middle Schools	2
Primary Education	2
Grade 10	1
Grade 3	1
Grade 4	1
Grade 5	1
Intermediate Grades	1
Junior High Schools	1
Kindergarten	1
More ▼

Audience

Practitioners	41
Teachers	21
Administrators	9
Students	8
Parents	6
Researchers	5
Counselors	2
Policymakers	2
Community	1

Location

Australia	7
Canada	6
Arizona	5
New Jersey	3
Pennsylvania	3
Georgia	2
Minnesota	2
Ohio	2
Oregon	2
South Korea	2
Tennessee	2
Alabama	1
Alaska	1
China	1
Colorado	1
District of Columbia	1
Florida	1
Hong Kong	1
Idaho	1
Illinois	1
Indiana	1
Indonesia	1
Iran	1
Ireland	1
Italy (Rome)	1
More ▼

Laws, Policies, & Programs

Comprehensive Education…	2
Education Consolidation…	1
Elementary and Secondary…	1
National Defense Education Act	1
No Child Left Behind Act 2001	1

What Works Clearinghouse Rating

Showing 1 to 15 of 222 results Save | Export

Using Multiple Maximum Exposure Rates in Computerized Adaptive Testing

Peer reviewed

Direct link

Kylie Gorney; Mark D. Reckase – Journal of Educational Measurement, 2025

In computerized adaptive testing, item exposure control methods are often used to provide a more balanced usage of the item pool. Many of the most popular methods, including the restricted method (Revuelta and Ponsoda), use a single maximum exposure rate to limit the proportion of times that each item is administered. However, Barrada et al.…

Descriptors: Computer Assisted Testing, Adaptive Testing, Test Items, Item Banks

Content Validity of Creativity Self-Report Questionnaires from PISA 2022

Peer reviewed

Direct link

B. Goecke; S. Weiss; B. Barbot – Journal of Creative Behavior, 2025

The present paper questions the content validity of the eight creativity-related self-report scales available in PISA 2022's context questionnaire and provides a set of considerations for researchers interested in using these indexes. Specifically, we point out some threats to the content validity of these scales (e.g., "creative thinking…

Descriptors: Creativity, Creativity Tests, Questionnaires, Content Validity

The Social Shapes Test as a Self-Administered, Online Measure of Social Intelligence: Two Studies with Typically Developing Adults and Adults with Autism Spectrum Disorder

Peer reviewed

Direct link

Matt I. Brown; Patrick R. Heck; Christopher F. Chabris – Journal of Autism and Developmental Disorders, 2024

The Social Shapes Test (SST) is a measure of social intelligence which does not use human faces or rely on extensive verbal ability. The SST has shown promising validity among adults without autism spectrum disorder (ASD), but it is uncertain whether it is suitable for adults with ASD. We find measurement invariance between adults with (n = 229)…

Descriptors: Interpersonal Competence, Autism Spectrum Disorders, Emotional Intelligence, Verbal Ability

Along the Convergent-Divergent Continuum: The Role of Task Structure in the PISA Creative Thinking Assessment

Peer reviewed

Direct link

Selcuk Acar; Yuyang Shen – Journal of Creative Behavior, 2025

Creativity tests, like creativity itself, vary widely in their structure and use. These differences include instructions, test duration, environments, prompt and response modalities, and the structure of test items. A key factor is task structure, referring to the specificity of the number of responses requested for a given prompt. Classic…

Descriptors: Creativity, Creative Thinking, Creativity Tests, Task Analysis

The Intent of ChatGPT Usage and Its Robustness in Medical Proficiency Exams: A Systematic Review

Peer reviewed

Direct link

Tatiana Chaiban; Zeinab Nahle; Ghaith Assi; Michelle Cherfane – Discover Education, 2024

Background: Since it was first launched, ChatGPT, a Large Language Model (LLM), has been widely used across different disciplines, particularly the medical field. Objective: The main aim of this review is to thoroughly assess the performance of the distinct version of ChatGPT in subspecialty written medical proficiency exams and the factors that…

Descriptors: Medical Education, Accuracy, Artificial Intelligence, Computer Software

Validation of the Korean Bilingual Version of the Vocabulary Size Test

Peer reviewed
PDF on ERIC

Download full text

Hae In Park – English Teaching, 2024

The present study aimed to validate a 70-item Korean bilingual version of the Vocabulary Size Test (VST) using Rasch modeling. The goal was to assess the applicability of this Korean version of the VST for Korean learners of English in an English as a foreign language (EFL) context by examining validity evidence based on Messick's framework.…

Descriptors: Korean, Bilingualism, English (Second Language), Second Language Learning

Developing Assessment Instrument Using Polytomous Response in Mathematics

Peer reviewed
PDF on ERIC

Download full text

Sutiarso, Sugeng; Rosidin, Undang; Sulistiawan, Aan – European Journal of Educational Research, 2022

This research is a developmental research aiming at developing a good mathematical test instrument using polytomous responses based on classical and modern theories. This research design uses the Plomp model, which consists of five stages, (1) preliminary investigation, (2) design, (3) realization/construction, (4) revision, and (5) implementation…

Descriptors: Mathematics Instruction, Mathematics Tests, Item Response Theory, Test Items

Developing a Whole Child School Screening Instrument: Evaluating Perceived Usability as an Initial Step in Planning for Consequential Validity

Peer reviewed

Direct link

Jessica B. Koslouski; Sandra M. Chafouleas; Amy Briesch; Jacqueline M. Caemmerer; Brittany Melo – School Mental Health, 2024

We are developing the Equitable Screening to Support Youth (ESSY) Whole Child Screener to address concerns prevalent in existing school-based screenings that impede goals to advance educational equity using universal screeners. Traditional assessment development does not include end users in the early development phases, instead relying on a…

Descriptors: Screening Tests, Psychometrics, Validity, Child Development

A Special Case of Brennan's Index for Tests That Aim to Select a Limited Number of Students: A Monte Carlo Simulation Study

Peer reviewed

Direct link

Arikan, Serkan; Aybek, Eren Can – Educational Measurement: Issues and Practice, 2022

Many scholars compared various item discrimination indices in real or simulated data. Item discrimination indices, such as item-total correlation, item-rest correlation, and IRT item discrimination parameter, provide information about individual differences among all participants. However, there are tests that aim to select a very limited number…

Descriptors: Monte Carlo Methods, Item Analysis, Correlation, Individual Differences

An Investigation into a Chinese Placement Test's Score Interpretations and Uses

Direct link

Wenyue Ma – ProQuest LLC, 2023

Foreign language placement testing, an important component in university foreign language programs, has received considerable, but not copious, attention over the years in second language (L2) testing research (Norris, 2004), and it has been mostly concentrated on L2 English. In contrast to validation research on L2 English placement testing, the…

Descriptors: Second Language Learning, Chinese, Student Placement, Placement Tests

New Perspectives on IELTS Authenticity: An Evaluation of the Speaking Module

Peer reviewed
PDF on ERIC

Download full text

Marzieh Souzandehfar – International Journal of Language Testing, 2024

This study represents the inaugural attempt at assessing the authenticity of the tasks encompassed in the IELTS Speaking Module. The evaluation is conducted from the vantage points of applied linguistics and general education, and serves to enhance comprehension of authenticity and authentic assessment. In order to achieve this objective, an…

Descriptors: Speech Communication, Thinking Skills, Problem Solving, Applied Linguistics

Measurement Properties of a Standardized Elicited Imitation Test: An Integrative Data Analysis

Peer reviewed

Direct link

Isbell, Daniel R.; Son, Young-A – Studies in Second Language Acquisition, 2022

Elicited Imitation Tests (EITs) are commonly used in second language acquisition (SLA)/bilingualism research contexts to assess the general oral proficiency of study participants. While previous studies have provided valuable EIT construct-related validity evidence, some key gaps remain. This study uses an integrative data analysis to further…

Descriptors: Bilingualism, Imitation, Language Tests, Second Language Learning

Clozing the Gap: How Far Do Cloze Items Measure?

Peer reviewed

Direct link

Trace, Jonathan – Language Testing, 2020

Originally designed to measure reading and passage comprehension in L1 readers, cloze tests continue to be used for L2 assessment purposes. However, there remain disputes about whether or not cloze items can measure beyond local comprehension information, as well as whether or not they are purely a test of reading alone, or if performance can be…

Descriptors: Cloze Procedure, Second Language Learning, Reading Comprehension, Native Language

Developing a Whole Child School Screening Instrument: Evaluating Perceived Usability as an Initial Step in Planning for Consequential Validity

Peer reviewed

Direct link

Jessica B. Koslouski; Sandra M. Chafouleas; Amy Briesch; Jacqueline M. Caemmerer; Brittany Melo – Grantee Submission, 2024

Descriptors: Screening Tests, Usability, Decision Making, Validity

Sustaining an Occupation-Specific Language Assessment for the Canadian Healthcare Field

Peer reviewed
PDF on ERIC

Download full text

Stewart, Gail; Strachan, Andrea – TESL Canada Journal, 2022

Since its implementation in 2004, the Canadian English Language Benchmark Assessment for Nurses (CELBAN) has been accepted as evidence of language ability for licensure of internationally educated nurses (IENs) in Canada. This article focuses on the complexities of sustaining an occupation-specific assessment over time. The authors reference the…

Descriptors: Language Tests, English for Special Purposes, Benchmarking, Nurses

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 15

Educational Measurement:…	8
Journal of Educational…	8
Educational and Psychological…	7
Applied Measurement in…	5
Ministerial Council on…	5
American Journal of…	2
Applied Psychological…	2
Grantee Submission	2
Journal of Creative Behavior	2
ProQuest LLC	2
Adolescence	1
American Language Review	1
Arithmetic Teacher	1
Assessment	1
Assessment and Accountability…	1
Bureau of Education,…	1
Center for Assessment and…	1
College Board	1
Computers & Education	1
Discover Education	1
ETS Research Report Series	1
Education Policy Analysis…	1
Educational Assessment	1
Educational Horizons	1
Educational Research Quarterly	1
More ▼

Donovan, Jenny	3
Lennon, Melissa	3
Martinez, Michael E.	3
Ackerman, Terry A.	2
Amy Briesch	2
Bennett, Randy Elliot	2
Brittany Melo	2
Cole, Nancy S.	2
Eignor, Daniel R.	2
Hutton, Penny	2
Jacqueline M. Caemmerer	2
Jessica B. Koslouski	2
Kitao, Kenji	2
Kitao, S. Kathleen	2
Lukhele, Robert	2
Morrissey, Noni	2
Nitko, Anthony J.	2
O'Connor, Gayl	2
Sandra M. Chafouleas	2
Stansfield, Charles W.	2
Thissen, David	2
Thompson, Bruce	2
Vispoel, Walter P.	2
Wainer, Howard	2
More ▼

Journal Articles	75
Reports - Research	62
Reports - Evaluative	59
Speeches/Meeting Papers	46
Reports - Descriptive	44
Guides - Non-Classroom	37
Tests/Questionnaires	24
Opinion Papers	10
Books	8
Information Analyses	6
Guides - Classroom - Teacher	5
Book/Product Reviews	4
Numerical/Quantitative Data	4
Collected Works - General	3
Dissertations/Theses -…	3
Historical Materials	3
Guides - Classroom - Learner	2
Guides - General	2
Collected Works - Proceedings	1
Dissertations/Theses -…	1
Multilingual/Bilingual…	1
Reference Materials -…	1
Reports - General	1
More ▼

National Assessment of…	9
Program for International…	6
Advanced Placement…	3
Graduate Record Examinations	2
International English…	2
Test of English as a Foreign…	2
Wechsler Intelligence Scale…	2
ACTFL Oral Proficiency…	1
California Achievement Tests	1
Career Maturity Inventory	1
Center for Epidemiologic…	1
Comprehensive Tests of Basic…	1
Differential Aptitude Test	1
Florida State Student…	1
Home Observation for…	1
Iowa Tests of Basic Skills	1
Iowa Tests of Educational…	1
National Longitudinal Survey…	1
National Teacher Examinations	1
North Carolina End of Course…	1
Peabody Picture Vocabulary…	1
Pennsylvania Educational…	1
Raven Progressive Matrices	1
Remote Associates Test	1
SAT (College Admission Test)	1
More ▼