ERIC - Search Results

Publication Date

In 2025	1
Since 2024	5
Since 2021 (last 5 years)	12
Since 2016 (last 10 years)	31
Since 2006 (last 20 years)	61

Descriptor

Simulation	201
Test Construction	201
Test Items	85
Computer Assisted Testing	64
Item Response Theory	48
Adaptive Testing	47
Scoring	26
Item Banks	25
Test Validity	25
Comparative Analysis	24
Measurement Techniques	24
Higher Education	21
Item Analysis	21
Evaluation Methods	20
Mathematical Models	19
Psychometrics	19
Test Reliability	18
Ability	17
Difficulty Level	17
Estimation (Mathematics)	16
Models	16
Statistical Analysis	16
Selection	15
Student Evaluation	15
Test Format	15
More ▼

Education Level

Postsecondary Education	7
Secondary Education	6
Higher Education	5
Middle Schools	3
Elementary Secondary Education	2
High Schools	2
Junior High Schools	2
Adult Education	1
Elementary Education	1
Grade 6	1
Intermediate Grades	1
More ▼

Audience

Practitioners	3
Administrators	1
Researchers	1
Teachers	1

Location

Germany	2
Israel	2
Alabama	1
Arkansas	1
California	1
Denmark	1
Iran	1
Maryland	1
Mexico	1
New Zealand	1
Nigeria	1
Spain (Madrid)	1
Taiwan	1
United Kingdom	1
United Kingdom (England)	1
Virginia	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Test of English as a Foreign…	3
Iowa Tests of Basic Skills	2
National Assessment of…	2
Work Keys (ACT)	2
Advanced Placement…	1
COMPASS (Computer Assisted…	1
Law School Admission Test	1
Program for International…	1
SAT (College Admission Test)	1
Trends in International…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 201 results Save | Export

Evaluating German PISA Stratification Designs: A Simulation Study

Peer reviewed

Direct link

Julia Mang; Helmut Küchenhoff; Sabine Meinck – Large-scale Assessments in Education, 2024

Stratification is an important design feature of many studies using complex sampling designs and it is often used in large-scale assessment (LSA) studies, such as the "Programme for International Student Assessment" (PISA), for two main reasons. First, stratification variables that achieve a high between and low within strata variance…

Descriptors: Foreign Countries, Achievement Tests, International Assessment, Secondary School Students

A Comparison of Final Scoring Methods under the Multistage Adaptive Testing Framework

Direct link

Hacer Karamese – ProQuest LLC, 2022

Multistage adaptive testing (MST) has become popular in the testing industry because the research has shown that it combines the advantages of both linear tests and item-level computer adaptive testing (CAT). The previous research efforts primarily focused on MST design issues such as panel design, module length, test length, distribution of test…

Descriptors: Adaptive Testing, Scoring, Computer Assisted Testing, Design

Comparison of Item Response Theory Ability and Item Parameters According to Classical and Bayesian Estimation Methods

Peer reviewed
PDF on ERIC

Download full text

Eray Selçuk; Ergül Demir – International Journal of Assessment Tools in Education, 2024

This research aims to compare the ability and item parameter estimations of Item Response Theory according to Maximum likelihood and Bayesian approaches in different Monte Carlo simulation conditions. For this purpose, depending on the changes in the priori distribution type, sample size, test length, and logistics model, the ability and item…

Descriptors: Item Response Theory, Item Analysis, Test Items, Simulation

Practical Considerations in Choosing an Anchor Test Form for Equating under the Random Groups Design

Peer reviewed

Direct link

Cui, Zhongmin; He, Yong – Measurement: Interdisciplinary Research and Perspectives, 2023

Careful considerations are necessary when there is a need to choose an anchor test form from a list of old test forms for equating under the random groups design. The choice of the anchor form potentially affects the accuracy of equated scores on new test forms. Few guidelines, however, can be found in the literature on choosing the anchor form.…

Descriptors: Test Format, Equated Scores, Best Practices, Test Construction

Computerized Multistage Testing: Principles, Designs and Practices with R

Peer reviewed

Direct link

Yigiter, Mahmut Sami; Dogan, Nuri – Measurement: Interdisciplinary Research and Perspectives, 2023

In recent years, Computerized Multistage Testing (MST), with their versatile benefits, have found themselves a wide application in large scale assessments and have increased their popularity. The fact that forms can be made ready before the exam application, such as a linear test, and that they can be adapted according to the test taker's ability…

Descriptors: Programming Languages, Monte Carlo Methods, Computer Assisted Testing, Test Format

Being Empathic in Complex Situations in Intercultural Education: A Practical Tool

Peer reviewed

Direct link

Landler-Pardo, Gabriella; Arviv Elyashiv, Rinat; Levi-Keren, Michal; Weinberger, Yehudith – Intercultural Education, 2022

Empathy, being multidimensional in nature, addresses cognitive, social-emotional, and behavioural components of interpersonal interaction. It is considered a core element of global competence. As schools become more diverse, empathy, which expresses the ability to observe social situations from other people's points of view plays a critical…

Descriptors: Empathy, Multicultural Education, Global Approach, Interpersonal Relationship

Capturing Competence: The Design, Evaluation, and Implementation of a Video-Based Instrument for Assessing Verbal Aggression Management Competence

Peer reviewed

Direct link

Delphine Franco; Ruben Vanderlinde; Martin Valcke – European Journal of Education, 2025

Complex competences, such as managing students' aggressive behaviour, are challenging to develop during teacher training. Recently, video-based simulations have been considered promising, yet suitable assessment instruments are limitedly available. This paper reports on the design and evaluation of a video-based assessment tool tailored to measure…

Descriptors: Preservice Teachers, Preservice Teacher Education, Student Behavior, Aggression

An Investigation of Item Calibration Methods in Multistage Testing

Peer reviewed

Direct link

Cai, Liuhan; Albano, Anthony D.; Roussos, Louis A. – Measurement: Interdisciplinary Research and Perspectives, 2021

Multistage testing (MST), an adaptive test delivery mode that involves algorithmic selection of predefined item modules rather than individual items, offers a practical alternative to linear and fully computerized adaptive testing. However, interactions across stages between item modules and examinee groups can lead to challenges in item…

Descriptors: Adaptive Testing, Test Items, Item Response Theory, Test Construction

Efficiency of Targeted Multistage Calibration Designs under Practical Constraints: A Simulation Study

Peer reviewed

Direct link

Berger, Stéphanie; Verschoor, Angela J.; Eggen, Theo J. H. M.; Moser, Urs – Journal of Educational Measurement, 2019

Calibration of an item bank for computer adaptive testing requires substantial resources. In this study, we investigated whether the efficiency of calibration under the Rasch model could be enhanced by improving the match between item difficulty and student ability. We introduced targeted multistage calibration designs, a design type that…

Descriptors: Simulation, Computer Assisted Testing, Test Items, Difficulty Level

Using Existing Data to Inform Development of New Item Types. Research Report. ETS RR-20-01

Peer reviewed
PDF on ERIC

Download full text

Guo, Hongwen; Ling, Guangming; Frankel, Lois – ETS Research Report Series, 2020

With advances in technology, researchers and test developers are developing new item types to measure complex skills like problem solving and critical thinking. Analyzing such items is often challenging because of their complicated response patterns, and thus it is important to develop psychometric methods for practitioners and researchers to…

Descriptors: Test Construction, Test Items, Item Analysis, Psychometrics

Chance-Constrained Automated Test Assembly

Peer reviewed

Direct link

Giada Spaccapanico Proietti; Mariagiulia Matteucci; Stefania Mignani; Bernard P. Veldkamp – Journal of Educational and Behavioral Statistics, 2024

Classical automated test assembly (ATA) methods assume fixed and known coefficients for the constraints and the objective function. This hypothesis is not true for the estimates of item response theory parameters, which are crucial elements in test assembly classical models. To account for uncertainty in ATA, we propose a chance-constrained…

Descriptors: Automation, Computer Assisted Testing, Ambiguity (Context), Item Response Theory

Measuring Language Ability of Students with Compensatory Multidimensional CAT: A Post-Hoc Simulation Study

Peer reviewed

Direct link

Ozdemir, Burhanettin; Gelbal, Selahattin – Education and Information Technologies, 2022

The computerized adaptive tests (CAT) apply an adaptive process in which the items are tailored to individuals' ability scores. The multidimensional CAT (MCAT) designs differ in terms of different item selection, ability estimation, and termination methods being used. This study aims at investigating the performance of the MCAT designs used to…

Descriptors: Scores, Computer Assisted Testing, Test Items, Language Proficiency

Developing Multistage Tests Using "D"-Scoring Method

Peer reviewed

Direct link

Han, Kyung T.; Dimitrov, Dimiter M.; Al-Mashary, Faisal – Educational and Psychological Measurement, 2019

The "D"-scoring method for scoring and equating tests with binary items proposed by Dimitrov offers some of the advantages of item response theory, such as item-level difficulty information and score computation that reflects the item difficulties, while retaining the merits of classical test theory such as the simplicity of number…

Descriptors: Test Construction, Scoring, Test Items, Adaptive Testing

Bias and Bias Correction Method for Nonproportional Abilities Requirement (NPAR) Tests

Peer reviewed

Direct link

Ip, Edward H.; Strachan, Tyler; Fu, Yanyan; Lay, Alexandra; Willse, John T.; Chen, Shyh-Huei; Rutkowski, Leslie; Ackerman, Terry – Journal of Educational Measurement, 2019

Test items must often be broad in scope to be ecologically valid. It is therefore almost inevitable that secondary dimensions are introduced into a test during test development. A cognitive test may require one or more abilities besides the primary ability to correctly respond to an item, in which case a unidimensional test score overestimates the…

Descriptors: Test Items, Test Bias, Test Construction, Scores

Item Calibration Methods with Multiple Subscale Multistage Testing

Peer reviewed

Direct link

Chun Wang; Ping Chen; Shengyu Jiang – Journal of Educational Measurement, 2020

Many large-scale educational surveys have moved from linear form design to multistage testing (MST) design. One advantage of MST is that it can provide more accurate latent trait [theta] estimates using fewer items than required by linear tests. However, MST generates incomplete response data by design; hence, questions remain as to how to…

Descriptors: Test Construction, Test Items, Adaptive Testing, Maximum Likelihood Statistics

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 14

Applied Psychological…	15
Journal of Educational…	14
Educational and Psychological…	9
Journal of Educational and…	6
Academic Medicine	5
Applied Measurement in…	5
Multivariate Behavioral…	5
ProQuest LLC	5
ETS Research Report Series	4
Measurement:…	3
IGI Global	2
Journal of Personnel…	2
Studies in Educational…	2
Alberta Journal of…	1
Australian Review of Applied…	1
Cogent Education	1
College Board	1
Education and Information…	1
Educational Researcher	1
Educational Technology &…	1
European Journal of Education	1
Evaluation and the Health…	1
Higher Education Quarterly	1
IAP - Information Age…	1
Intercultural Education	1
More ▼

Weiss, David J.	6
Reckase, Mark D.	5
Davey, Tim	4
Hambleton, Ronald K.	4
Schnipke, Deborah L.	4
Stocking, Martha L.	4
Berger, Martijn P. F.	3
Chang, Hua-Hua	3
Harris, Deborah J.	3
Meijer, Rob R.	3
Reese, Lynda M.	3
van der Linden, Wim J.	3
Almond, Russell G.	2
Antal, Judit	2
Baker, Eva L.	2
Betz, Nancy E.	2
Chung, Gregory K. W. K.	2
Eignor, Daniel R.	2
Finkelman, Matthew D.	2
Guo, Hongwen	2
Hanson, Bradley A.	2
Hau, Kit-Tai	2
Kim, Wonsuk	2
Lee, Guemin	2
More ▼

Journal Articles	97
Reports - Research	96
Reports - Evaluative	67
Speeches/Meeting Papers	42
Reports - Descriptive	14
Dissertations/Theses -…	5
Books	4
Collected Works - General	4
Tests/Questionnaires	4
Information Analyses	3
Numerical/Quantitative Data	3
Guides - Non-Classroom	2
Collected Works - Proceedings	1
Guides - Classroom - Teacher	1
Guides - General	1
Non-Print Media	1
Opinion Papers	1
Reference Materials - General	1
Reports - General	1
More ▼