ERIC - Search Results

Publication Date

In 2025	31
Since 2024	122
Since 2021 (last 5 years)	345
Since 2016 (last 10 years)	710
Since 2006 (last 20 years)	1190

Descriptor

Test Construction	2673
Test Items	2673
Test Validity	687
Test Reliability	550
Foreign Countries	533
Item Analysis	478
Difficulty Level	423
Multiple Choice Tests	407
Item Response Theory	391
Computer Assisted Testing	380
Test Format	360
Higher Education	332
Psychometrics	299
Item Banks	278
Elementary Secondary Education	242
Achievement Tests	237
Student Evaluation	230
Scores	224
Evaluation Methods	217
Mathematics Tests	209
Language Tests	198
Scoring	195
Factor Analysis	192
Statistical Analysis	176
Comparative Analysis	175
More ▼

Education Level

Higher Education	318
Postsecondary Education	279
Secondary Education	229
Elementary Education	209
Middle Schools	103
High Schools	96
Elementary Secondary Education	92
Junior High Schools	78
Intermediate Grades	58
Grade 8	57
Early Childhood Education	54
Grade 4	46
Grade 5	36
Primary Education	36
Grade 6	33
Grade 7	30
Grade 2	25
Grade 3	25
Kindergarten	20
Grade 1	15
Grade 12	14
Preschool Education	13
Grade 9	12
Adult Education	11
Grade 10	10
More ▼

Audience

Practitioners	155
Teachers	114
Researchers	99
Administrators	31
Students	17
Policymakers	6
Parents	4
Counselors	3
Support Staff	3

Location

Turkey	63
Australia	53
Canada	30
Florida	26
Indonesia	26
Germany	24
United Kingdom	22
United Kingdom (England)	20
China	18
Oregon	16
Japan	15
Iran	13
United States	13
California	12
Georgia	12
Netherlands	12
Taiwan	12
Nigeria	11
Texas	10
Hong Kong	9
Illinois	9
Israel	9
Massachusetts	9
South Korea	9
Delaware	8
More ▼

Laws, Policies, & Programs

Individuals with Disabilities…	16
No Child Left Behind Act 2001	10
Rehabilitation Act 1973…	4
Every Student Succeeds Act…	3
Comprehensive Education…	2
Race to the Top	2
Elementary and Secondary…	1
Elementary and Secondary…	1
Individuals with Disabilities…	1
Kentucky Education Reform Act…	1
National Defense Education Act	1
More ▼

What Works Clearinghouse Rating

Meets WWC Standards without Reservations	1
Meets WWC Standards with or without Reservations	1
Does not meet standards	1

Showing 1 to 15 of 2,673 results Save | Export

A Workflow for Minimizing Errors in Template-Based Automated Item-Generation Development

Peer reviewed

Direct link

Yanyan Fu – Educational Measurement: Issues and Practice, 2024

The template-based automated item-generation (TAIG) approach that involves template creation, item generation, item selection, field-testing, and evaluation has more steps than the traditional item development method. Consequentially, there is more margin for error in this process, and any template errors can be cascaded to the generated items.…

Descriptors: Error Correction, Automation, Test Items, Test Construction

Item-Writing Guidelines on Response Option Placement: A Systematic Review

Peer reviewed

Direct link

Séverin Lions; María Paz Blanco; Pablo Dartnell; Carlos Monsalve; Gabriel Ortega; Julie Lemarié – Applied Measurement in Education, 2024

Multiple-choice items are universally used in formal education. Since they should assess learning, not test-wiseness or guesswork, they must be constructed following the highest possible standards. Hundreds of item-writing guides have provided guidelines to help test developers adopt appropriate strategies to define the distribution and sequence…

Descriptors: Test Construction, Multiple Choice Tests, Guidelines, Test Items

Controlling the Speededness of Assembled Test Forms: A Generalization to the Three-Parameter Lognormal Response Time Model

Peer reviewed

Direct link

Becker, Benjamin; Weirich, Sebastian; Goldhammer, Frank; Debeer, Dries – Journal of Educational Measurement, 2023

When designing or modifying a test, an important challenge is controlling its speededness. To achieve this, van der Linden (2011a, 2011b) proposed using a lognormal response time model, more specifically the two-parameter lognormal model, and automated test assembly (ATA) via mixed integer linear programming. However, this approach has a severe…

Descriptors: Test Construction, Automation, Models, Test Items

Are the Steps on Likert Scales Equidistant? Responses on Visual Analog Scales Allow Estimating Their Distances

Peer reviewed

Direct link

Miguel A. García-Pérez – Educational and Psychological Measurement, 2024

A recurring question regarding Likert items is whether the discrete steps that this response format allows represent constant increments along the underlying continuum. This question appears unsolvable because Likert responses carry no direct information to this effect. Yet, any item administered in Likert format can identically be administered…

Descriptors: Likert Scales, Test Construction, Test Items, Item Analysis

EQGG: Automatic Question Group Generation

Peer reviewed

Direct link

Po-Chun Huang; Ying-Hong Chan; Ching-Yu Yang; Hung-Yuan Chen; Yao-Chung Fan – IEEE Transactions on Learning Technologies, 2024

Question generation (QG) task plays a crucial role in adaptive learning. While significant QG performance advancements are reported, the existing QG studies are still far from practical usage. One point that needs strengthening is to consider the generation of question group, which remains untouched. For forming a question group, intrafactors…

Descriptors: Automation, Test Items, Computer Assisted Testing, Test Construction

Optimal Calibration of Items for Multidimensional Achievement Tests

Peer reviewed

Direct link

Mahmood Ul Hassan; Frank Miller – Journal of Educational Measurement, 2024

Multidimensional achievement tests are recently gaining more importance in educational and psychological measurements. For example, multidimensional diagnostic tests can help students to determine which particular domain of knowledge they need to improve for better performance. To estimate the characteristics of candidate items (calibration) for…

Descriptors: Multidimensional Scaling, Achievement Tests, Test Items, Test Construction

Using Content Relevance and Representativeness Indices in Instrument Revision

Peer reviewed

Direct link

Anne Traynor; Sara C. Christopherson – Applied Measurement in Education, 2024

Combining methods from earlier content validity and more contemporary content alignment studies may allow a more complete evaluation of the meaning of test scores than if either set of methods is used on its own. This article distinguishes item relevance indices in the content validity literature from test representativeness indices in the…

Descriptors: Test Validity, Test Items, Achievement Tests, Test Construction

Measuring Austrian Students' Procedural Knowledge at the End of Upper Secondary Level

Peer reviewed

Direct link

Christoph Ableitinger; Christian Dorner – International Journal of Mathematical Education in Science and Technology, 2025

The number of complaints university lecturers make about a lack of knowledge, especially first-year students' procedural knowledge, has increased recently. Due to missing adequate empirical evidence, a survey of procedural knowledge among students of Austrian high schools in their final year was conducted. For this purpose, test items for…

Descriptors: Knowledge Level, Cognitive Processes, High School Seniors, Foreign Countries

Peer reviewed

Direct link

Chan Zhang; Shuaiying Cao; Minglei Wang; Jiangyan Wang; Lirui He – Field Methods, 2025

Previous research on grid questions has mostly focused on their comparability with the item-by-item method and the use of shading to help respondents navigate through a grid. This study extends prior work by examining whether lexical similarity among grid items affects how respondents answer the questions in an experiment where we manipulated…

Descriptors: Foreign Countries, Surveys, Test Construction, Design

Evaluating Youth Empowerment: The Construction and Validation of an Inventory of Dimensions and Indicators

Peer reviewed

Direct link

Anna Planas-Lladó; Xavier Úcar – American Journal of Evaluation, 2024

Empowerment is a concept that has become increasingly used over recent years. However, little research has been undertaken into how empowerment can be evaluated, particularly in the case of young people. The aim of this article is to present an inventory of dimensions and indicators of youth empowerment. The article describes the various phases in…

Descriptors: Youth, Empowerment, Test Construction, Test Validity

A Method for Generating Course Test Questions Based on Natural Language Processing and Deep Learning

Peer reviewed

Direct link

Hei-Chia Wang; Yu-Hung Chiang; I-Fan Chen – Education and Information Technologies, 2024

Assessment is viewed as an important means to understand learners' performance in the learning process. A good assessment method is based on high-quality examination questions. However, generating high-quality examination questions manually by teachers is a time-consuming task, and it is not easy for students to obtain question banks. To solve…

Descriptors: Natural Language Processing, Test Construction, Test Items, Models

An Evaluation Method for the Designing Quality of Language Item Questions in Censuses

Peer reviewed

Direct link

Haokun Liu – International Journal of Multilingualism, 2025

Globally, countries or regions across from east to west like Hong Kong, Macao, Taiwan, Singapore, the United Kingdom, and the United States have incorporated language item questions in their censuses. The assessment of such design advantages and disadvantages is crucial for academic investigation. Despite ongoing discussions, there is a noticeable…

Descriptors: Language Usage, Demography, Surveys, Questionnaires

Developing an MLA-Test for Young Learners -- Insights from Measurement Theory and Language Testing

Peer reviewed

Direct link

Kaja Haugen; Cecilie Hamnes Carlsen; Christine Möller-Omrani – Language Awareness, 2025

This article presents the process of constructing and validating a test of metalinguistic awareness (MLA) for young school children (age 8-10). The test was developed between 2021 and 2023 as part of the MetaLearn research project, financed by The Research Council of Norway. The research team defines MLA as using metalinguistic knowledge at a…

Descriptors: Language Tests, Test Construction, Elementary School Students, Metalinguistics

A Generalized Objective Function for Computer Adaptive Item Selection

Peer reviewed

Direct link

Harold Doran; Testsuhiro Yamada; Ted Diaz; Emre Gonulates; Vanessa Culver – Journal of Educational Measurement, 2025

Computer adaptive testing (CAT) is an increasingly common mode of test administration offering improved test security, better measurement precision, and the potential for shorter testing experiences. This article presents a new item selection algorithm based on a generalized objective function to support multiple types of testing conditions and…

Descriptors: Computer Assisted Testing, Adaptive Testing, Test Items, Algorithms

An Evaluation of Automatic Item Generation: A Case Study of Weak Theory Approach

Peer reviewed

Direct link

Fu, Yanyan; Choe, Edison M.; Lim, Hwanggyu; Choi, Jaehwa – Educational Measurement: Issues and Practice, 2022

This case study applied the "weak theory" of Automatic Item Generation (AIG) to generate isomorphic item instances (i.e., unique but psychometrically equivalent items) for a large-scale assessment. Three representative instances were selected from each item template (i.e., model) and pilot-tested. In addition, a new analytical framework,…

Descriptors: Test Items, Measurement, Psychometrics, Test Construction

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 179

Journal of Educational…	84
Educational and Psychological…	77
Applied Psychological…	55
ProQuest LLC	48
Applied Measurement in…	45
Educational Measurement:…	42
Behavioral Research and…	35
ETS Research Report Series	35
Online Submission	32
Grantee Submission	25
Educational Assessment	18
Language Assessment Quarterly	18
International Journal of…	17
Language Testing	17
Journal of Psychoeducational…	16
Journal of Applied Testing…	15
International Journal of…	13
Journal of Experimental…	13
Psychometrika	13
College Board	12
International Journal of…	12
Journal of Research in…	12
Chemistry Education Research…	10
Education and Information…	10
International Association for…	10
More ▼

Tindal, Gerald	34
Alonzo, Julie	29
Hambleton, Ronald K.	24
van der Linden, Wim J.	23
Anderson, Daniel	18
Veldkamp, Bernard P.	16
Haladyna, Thomas M.	15
Park, Bitnara Jasmine	14
Reckase, Mark D.	14
Stocking, Martha L.	14
Gierl, Mark J.	13
Wainer, Howard	13
Sireci, Stephen G.	11
Roid, Gale	10
Stansfield, Charles W.	10
Schoen, Robert C.	9
Benson, Jeri	8
Berk, Ronald A.	8
Humes, Ann	8
Huntley, Renee M.	8
Irvin, P. Shawn	8
Ackerman, Terry A.	7
Bennett, Randy Elliot	7
Chang, Hua-Hua	7
More ▼

Journal Articles	1439
Reports - Research	1435
Reports - Evaluative	475
Speeches/Meeting Papers	427
Reports - Descriptive	331
Tests/Questionnaires	254
Guides - Non-Classroom	137
Information Analyses	83
Numerical/Quantitative Data	82
Opinion Papers	60
Guides - Classroom - Teacher	59
Dissertations/Theses -…	50
Books	26
Guides - General	20
ERIC Publications	15
Collected Works - General	14
Book/Product Reviews	13
ERIC Digests in Full Text	12
Guides - Classroom - Learner	9
Non-Print Media	9
Reference Materials - General	8
Reports - General	7
Reference Materials -…	6
Collected Works - Proceedings	5
Historical Materials	4
More ▼

National Assessment of…	54
SAT (College Admission Test)	32
Program for International…	26
Graduate Record Examinations	25
Test of English as a Foreign…	23
ACT Assessment	15
Advanced Placement…	15
Trends in International…	13
Law School Admission Test	12
Texas Educational Assessment…	9
Iowa Tests of Basic Skills	7
Armed Services Vocational…	6
Praxis Series	6
Delaware Student Testing…	5
Metropolitan Achievement Tests	5
National Teacher Examinations	5
Raven Progressive Matrices	5
Stanford Achievement Tests	5
Test of English for…	5
Flesch Kincaid Grade Level…	4
Georgia High School…	4
Minnesota Multiphasic…	4
Progress in International…	4
Comprehensive Tests of Basic…	3
Cornell Critical Thinking Test	3
More ▼