ERIC - Search Results

Publication Date

In 2026	0
Since 2025	5
Since 2022 (last 5 years)	45
Since 2017 (last 10 years)	91
Since 2007 (last 20 years)	144

Descriptor

Test Format	418
Test Reliability	418
Test Validity	243
Test Construction	135
Test Items	119
Higher Education	88
Multiple Choice Tests	68
Foreign Countries	67
Testing	65
Test Interpretation	61
Comparative Analysis	57
Language Tests	57
Computer Assisted Testing	55
Scores	53
Scoring	51
Student Evaluation	46
Psychometrics	44
Test Use	44
Standardized Tests	43
Elementary Secondary Education	40
Item Analysis	40
Test Content	40
College Students	36
Second Language Learning	36
Test Reviews	36
More ▼

Education Level

Higher Education	50
Postsecondary Education	42
Secondary Education	25
Elementary Education	24
Middle Schools	17
Junior High Schools	15
High Schools	10
Grade 8	9
Grade 7	8
Early Childhood Education	7
Elementary Secondary Education	7
Grade 3	7
Grade 5	7
Intermediate Grades	7
Grade 4	6
Grade 6	6
Primary Education	6
Adult Education	2
Kindergarten	2
Grade 1	1
Grade 9	1
Preschool Education	1
More ▼

Audience

Practitioners	33
Teachers	23
Administrators	18
Researchers	12
Community	1
Counselors	1
Policymakers	1
Students	1
Support Staff	1

Location

New York	9
Turkey	8
California	7
Canada	6
Japan	6
Germany	4
United Kingdom	4
Georgia	3
Israel	3
France	2
Indonesia	2
Iran	2
Netherlands	2
New York (New York)	2
Nigeria	2
Singapore	2
South Africa	2
United Kingdom (Great Britain)	2
Bangladesh	1
Brazil	1
China	1
Connecticut	1
Czech Republic	1
Estonia	1
Finland	1
More ▼

Laws, Policies, & Programs

Individuals with Disabilities…	1
Job Training Partnership Act…	1
No Child Left Behind Act 2001	1
Pell Grant Program	1

What Works Clearinghouse Rating

Showing 1 to 15 of 418 results Save | Export

A Review of Automatic Item Generation Techniques Leveraging Large Language Models

Peer reviewed
PDF on ERIC

Download full text

Bin Tan; Nour Armoush; Elisabetta Mazzullo; Okan Bulut; Mark J. Gierl – International Journal of Assessment Tools in Education, 2025

This study reviews existing research on the use of large language models (LLMs) for automatic item generation (AIG). We performed a comprehensive literature search across seven research databases, selected studies based on predefined criteria, and summarized 60 relevant studies that employed LLMs in the AIG process. We identified the most commonly…

Descriptors: Artificial Intelligence, Test Items, Automation, Test Format

The MSRT: A Critical Review of English Proficiency in Iran

Peer reviewed

Direct link

Muhammed Parviz; Masoud Azizi – Discover Education, 2025

This article offers a critical review of the Ministry of Science, Research, and Technology English Proficiency Test (MSRT), a high-stakes exam required for postgraduate graduation, scholarships, and certain employment positions in Iran. Despite its widespread use, the design and implementation of the MSRT raise concerns about its validity and…

Descriptors: Language Tests, Language Proficiency, English (Second Language), Second Language Learning

A Comparison of Yen's Q3 Coefficient and Rasch Testlet Modeling for Identifying Local Item Dependence: Evidence from Two Vocabulary Matching Tests

Peer reviewed

Direct link

Hung Tan Ha; Duyen Thi Bich Nguyen; Tim Stoeckel – Language Assessment Quarterly, 2025

This article compares two methods for detecting local item dependence (LID): residual correlation examination and Rasch testlet modeling (RTM), in a commonly used 3:6 matching format and an extended matching test (EMT) format. The two formats are hypothesized to facilitate different levels of item dependency due to differences in the number of…

Descriptors: Comparative Analysis, Language Tests, Test Items, Item Analysis

2023-2024 NSCAS Growth: English Language Arts, Mathematics, and Science Technical Report

Download full text

Nebraska Department of Education, 2024

The Nebraska Student-Centered Assessment System (NSCAS) is a statewide assessment system that embodies Nebraska's holistic view of students and helps them prepare for success in postsecondary education, career, and civic life. It uses multiple measures throughout the year to provide educators and decision-makers at all levels with the insights…

Descriptors: Student Evaluation, Evaluation Methods, Elementary School Students, Middle School Students

Selecting Technically Adequate Tests

Peer reviewed

Direct link

Susan K. Johnsen – Gifted Child Today, 2024

The author provides a checklist for educators who are selecting technically adequate tests for identifying and referring students for gifted education services and programs. The checklist includes questions related to how the test was normed, reliability and validity studies as well as questions related to types of scores, administration, and…

Descriptors: Test Selection, Academically Gifted, Gifted Education, Test Validity

New York State Testing Program: Grades 6 and 7 English Language Arts Paper-Based Tests. Teacher's Directions. Spring 2024

Download full text

New York State Education Department, 2024

The New York State Education Department (NYSED) has a partnership with NWEA for the development of the 2024 Grades 3-8 English Language Arts Tests. Teachers from across the State work with NYSED in a variety of activities to ensure the validity and reliability of the New York State Testing Program (NYSTP). The 2024 Grades 6 and 7 English Language…

Descriptors: Language Tests, Test Format, Language Arts, English Instruction

The Effects of Reverse Items on Psychometric Properties and Respondents' Scale Scores According to Different Item Reversal Strategies

Peer reviewed
PDF on ERIC

Download full text

Mustafa Ilhan; Nese Güler; Gülsen Tasdelen Teker; Ömer Ergenekon – International Journal of Assessment Tools in Education, 2024

This study aimed to examine the effects of reverse items created with different strategies on psychometric properties and respondents' scale scores. To this end, three versions of a 10-item scale in the research were developed: 10 positive items were integrated in the first form (Form-P) and five positive and five reverse items in the other two…

Descriptors: Test Items, Psychometrics, Scores, Measures (Individuals)

Meta[superscript 2]: A Meta-Analysis and Psychometric Evaluation of the Metacognitive Awareness Inventory (MAI) in the Context of Health Professions Education

Peer reviewed

Direct link

Andrew S. Cale; Elizabeth R. Agosto; Brenda Kucha Anak Ganeng; Megan E. Kruskie; Margaret A. McNulty; Kyle A. Robertson; Cecelia J. Vetter; Sabrina C. Woods; Md. Nazmul Karim; Adam B. Wilson – Anatomical Sciences Education, 2025

To keep pace with medicine's unpredictable changes, medical trainees must learn to accurately monitor and evaluate themselves via metacognition (i.e., thinking about thinking). The Metacognitive Awareness Inventory (MAI) can assess and guide the metacognitive development of trainees. This study summarizes existing psychometric evidence and…

Descriptors: Meta Analysis, Psychometrics, Metacognition, Measures (Individuals)

A Two-Level Adaptive Test Battery

Peer reviewed

Direct link

Wim J. van der Linden; Luping Niu; Seung W. Choi – Journal of Educational and Behavioral Statistics, 2024

A test battery with two different levels of adaptation is presented: a within-subtest level for the selection of the items in the subtests and a between-subtest level to move from one subtest to the next. The battery runs on a two-level model consisting of a regular response model for each of the subtests extended with a second level for the joint…

Descriptors: Adaptive Testing, Test Construction, Test Format, Test Reliability

The DAATS Battery Short Form as a Measure of Teacher Dispositions

Peer reviewed
PDF on ERIC

Download full text

Judy R. Wilkerson; W. Steve Lang; LaSonya Moore – Journal of Research in Education, 2025

The DAATS (Dispositions Assessments Aligned with Teacher Standards) battery is a series of five instruments of different item types that measure teachers' consistency with the critical dispositions embedded in the InTASC Standards. The purpose of this study was to continue a 20-year research project on the development and implementation of…

Descriptors: Educational Assessment, National Standards, Teacher Evaluation, Teacher Competencies

Measuring Mathematical Skills in Early Childhood: A Systematic Review of the Psychometric Properties of Early Maths Assessments and Screeners

Peer reviewed

Direct link

Laura A. Outhwaite; Pirjo Aunio; Jaimie Ka Yu Leung; Jo Van Herwegen – Educational Psychology Review, 2024

Successful early mathematical development is vital to children's later education, employment, and wellbeing outcomes. However, established measurement tools are infrequently used to (i) assess children's mathematical skills and (ii) identify children with or at-risk of mathematical learning difficulties. In response, this pre-registered systematic…

Descriptors: Mathematics Tests, Screening Tests, Mathematics Skills, At Risk Students

Charting the Future of Assessments. Research Report. ETS RR-24-13

Peer reviewed
PDF on ERIC

Download full text

Patrick Kyllonen; Amit Sevak; Teresa Ober; Ikkyu Choi; Jesse Sparks; Daniel Fishtein – ETS Research Report Series, 2024

Assessment refers to a broad array of approaches for measuring or evaluating a person's (or group of persons') skills, behaviors, dispositions, or other attributes. Assessments range from standardized tests used in admissions, employee selection, licensure examinations, and domestic and international large-scale assessments of cognitive and…

Descriptors: Assessment Literacy, Testing, Test Bias, Test Construction

Do Different Devices Perform Equally Well with Different Numbers of Scale Points and Response Formats? A Test of Measurement Invariance and Reliability

Peer reviewed

Direct link

Natalja Menold; Vera Toepoel – Sociological Methods & Research, 2024

Research on mixed devices in web surveys is in its infancy. Using a randomized experiment, we investigated device effects (desktop PC, tablet and mobile phone) for six response formats and four different numbers of scale points. N = 5,077 members of an online access panel participated in the experiment. An exact test of measurement invariance and…

Descriptors: Online Surveys, Handheld Devices, Telecommunications, Test Reliability

New York State Testing Program: English Language Arts, Mathematics, and Science Tests. School Administrator's Manual, 2024. Grades 3-8

Download full text

New York State Education Department, 2024

The instructions in this manual explain the responsibilities of school administrators for the New York State Testing Program (NYSTP) Grades 3-8 English Language Arts, Mathematics, and Grades 5 & 8 Science Tests. School administrators must be thoroughly familiar with the contents of the manual, and the policies and procedures must be followed…

Descriptors: Testing Programs, Language Arts, Mathematics Tests, Science Tests

Innovations in Assessing Students' Digital Literacy Skills in Learning Science: Effective Multiple Choice Closed-Ended Tests Using Rasch Model

Peer reviewed
PDF on ERIC

Download full text

Fitria Lafifa; Dadan Rosana – Turkish Online Journal of Distance Education, 2024

This research goal to develop a multiple-choice closed-ended test to assessing and evaluate students' digital literacy skills. The sample in this study were students at MTsN 1 Blitar City who were selected using a purposive sampling technique. The test was also validated by experts, namely 2 Doctors of Physics and Science from Yogyakarta State…

Descriptors: Educational Innovation, Student Evaluation, Digital Literacy, Multiple Choice Tests

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 28

Diagnostique	26
Educational and Psychological…	22
Journal of Educational…	9
Language Testing	9
New York State Education…	9
Psychological Assessment	7
Applied Psychological…	5
ETS Research Report Series	5
International Journal of…	5
Journal of Reading	5
Language Assessment Quarterly	5
Applied Measurement in…	4
Assessment	4
ProQuest LLC	4
Assessment & Evaluation in…	3
Assessment for Effective…	3
College Board	3
Evaluation and the Health…	3
Grantee Submission	3
Journal of Experimental…	3
Journal of Psychoeducational…	3
Perceptual and Motor Skills	3
Practical Assessment,…	3
Academic Medicine	2
Annual Review of Applied…	2
More ▼

White, Edward M.	6
Melancon, Janet G.	4
Thompson, Bruce	4
Trevisan, Michael S.	4
Federico, Pat-Anthony	3
Frisbie, David A.	3
Hambleton, Ronald K.	3
Sax, Gilbert	3
Stansfield, Charles W.	3
Straus, Murray A.	3
Aiken, Lewis R.	2
Alderson, J. Charles	2
Brown, James Dean	2
Bush, Martin	2
Conoyer, Sarah J.	2
Eignor, Daniel R.	2
Green, Kathy	2
Hamby, Sherry L.	2
Hendrickson, Amy	2
Henk, William A.	2
Henning, Grant	2
Kapes, Jerome T.	2
Liskin-Gasparro, Judith E.	2
Menold, Natalja	2
More ▼

Journal Articles	265
Reports - Research	239
Speeches/Meeting Papers	63
Reports - Descriptive	61
Reports - Evaluative	57
Information Analyses	25
Opinion Papers	24
Guides - Non-Classroom	21
Tests/Questionnaires	20
Guides - Classroom - Teacher	10
Guides - General	6
Numerical/Quantitative Data	5
Dissertations/Theses -…	4
Reference Materials -…	4
Books	3
ERIC Publications	1
Guides - Classroom - Learner	1
Non-Print Media	1
Reference Materials -…	1
Reference Materials - General	1
More ▼

Test of English as a Foreign…	5
Embedded Figures Test	3
SAT (College Admission Test)	3
Wechsler Adult Intelligence…	3
Wechsler Intelligence Scale…	3
ACT Assessment	2
Beck Depression Inventory	2
Graduate Record Examinations	2
Minnesota Multiphasic…	2
Peabody Picture Vocabulary…	2
ACTFL Oral Proficiency…	1
Armed Services Vocational…	1
Attribution Style…	1
Behavior Assessment System…	1
Bem Sex Role Inventory	1
Bruininks Oseretsky Test of…	1
California Critical Thinking…	1
Canfield Learning Styles…	1
Computer Attitude Scale	1
Conflict Tactics Scale	1
Conners Rating Scales	1
Cornell Critical Thinking Test	1
Defining Issues Test	1
Developmental Indicators for…	1
Dimensions of Self Concept	1
More ▼