ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	6
Since 2006 (last 20 years)	10

Descriptor

Difficulty Level	12
Scores	12
Simulation	12
Test Items	8
Comparative Analysis	4
Item Response Theory	4
Sample Size	3
Statistical Analysis	3
Cognitive Processes	2
Computation	2
Computer Assisted Testing	2
Correlation	2
Equated Scores	2
Item Analysis	2
Models	2
Multiple Choice Tests	2
Psychometrics	2
Regression (Statistics)	2
Test Format	2
Test Length	2
Test Reliability	2
Test Theory	2
Test Wiseness	2
Academic Achievement	1
Achievement Tests	1
More ▼

Source

ETS Research Report Series	3
Applied Psychological…	1
Educational Research and…	1
Hacettepe University Journal…	1
International Education…	1
International Journal of…	1
International Journal of…	1
Journal of Educational…	1
ProQuest LLC	1

Publication Type

Journal Articles	10
Reports - Research	10
Dissertations/Theses -…	1
Reports - Evaluative	1
Speeches/Meeting Papers	1

Education Level

Higher Education	2
Postsecondary Education	2
Two Year Colleges	1

Audience

Researchers

Location

Turkey

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 12 results Save | Export

Classical Item Analysis from a Signal Detection Perspective

Peer reviewed

Direct link

DeCarlo, Lawrence T. – Journal of Educational Measurement, 2023

A conceptualization of multiple-choice exams in terms of signal detection theory (SDT) leads to simple measures of item difficulty and item discrimination that are closely related to, but also distinct from, those used in classical item analysis (CIA). The theory defines a "true split," depending on whether or not examinees know an item,…

Descriptors: Multiple Choice Tests, Test Items, Item Analysis, Test Wiseness

A Simulation-Based Method for Finding the Optimal Number of Options for Multiple-Choice Items on a Test. Research Report. ETS RR-18-22

Peer reviewed
PDF on ERIC

Download full text

Guo, Hongwen; Zu, Jiyun; Kyllonen, Patrick – ETS Research Report Series, 2018

For a multiple-choice test under development or redesign, it is important to choose the optimal number of options per item so that the test possesses the desired psychometric properties. On the basis of available data for a multiple-choice assessment with 8 options, we evaluated the effects of changing the number of options on test properties…

Descriptors: Multiple Choice Tests, Test Items, Simulation, Test Construction

Designing Cognitively Diagnostic Assessment for Algebraic Content Knowledge and Thinking Skills

Peer reviewed
PDF on ERIC

Download full text

Zhang, Zhidong – International Education Studies, 2018

This study explored a diagnostic assessment method that emphasized the cognitive process of algebra learning. The study utilized a design and a theory-driven model to examine the content knowledge. Using the theory driven model, the thinking skills of algebra learning was also examined. A Bayesian network model was applied to represent the theory…

Descriptors: Algebra, Bayesian Statistics, Scores, Mathematics Achievement

Effects of Various Simulation Conditions on Latent-Trait Estimates: A Simulation Study

Peer reviewed
PDF on ERIC

Download full text

Kogar, Hakan – International Journal of Assessment Tools in Education, 2018

The aim of this simulation study, determine the relationship between true latent scores and estimated latent scores by including various control variables and different statistical models. The study also aimed to compare the statistical models and determine the effects of different distribution types, response formats and sample sizes on latent…

Descriptors: Simulation, Context Effect, Computation, Statistical Analysis

The Development of a Content Assessment of Basic Electronics Knowledge. Research Report. ETS RR-20-28

Peer reviewed
PDF on ERIC

Download full text

Steinberg, Jonathan; Andrews-Todd, Jessica; Forsyth, Carolyn; Chamberlain, John; Horwitz, Paul; Koon, Al; Rupp, Andre; McCulla, Laura – ETS Research Report Series, 2020

This study discusses the development of a basic electronics knowledge (BEK) assessment as a pretest activity for undergraduate students in engineering and related fields. The 28 BEK items represent 12 key concepts, including properties of serial circuits, knowledge of electrical laws (e.g., Kirchhoff 's and Ohm's laws), and properties of digital…

Descriptors: Knowledge Level, Skill Development, Psychometrics, Student Evaluation

An Investigation of the Impact of Misrouting under Two-Stage Multistage Testing: A Simulation Study. Research Report. ETS RR-14-01

Peer reviewed
PDF on ERIC

Download full text

Kim, Sooyeon; Moses, Tim – ETS Research Report Series, 2014

The purpose of this study was to investigate the potential impact of misrouting under a 2-stage multistage test (MST) design, which includes 1 routing and 3 second-stage modules. Simulations were used to create a situation in which a large group of examinees took each of the 3 possible MST paths (high, middle, and low). We compared differences in…

Descriptors: Comparative Analysis, Difficulty Level, Scores, Test Wiseness

How Do the Cognitive Load, Self-Efficacy and Attitude of Pre-Service Teachers Shift in the Multimedia Science Learning Process?

Peer reviewed
PDF on ERIC

Download full text

Efendioglu, Akin – Educational Research and Reviews, 2016

The aim of this study is to investigate pre-service teacher's cognitive load types (intrinsic load-IL, extraneous load-EL, and germane load-GL), academic achievements, and affective characteristics (attitude and self-efficacy) at two stages of experimental learning processes. The first and the second groups used explanatory instructional…

Descriptors: Foreign Countries, Preservice Teachers, Cognitive Processes, Difficulty Level

Conditions Affecting the Accuracy of Classical Equating Methods for Small Samples under the NEAT Design: A Simulation Study

Direct link

Sunnassee, Devdass – ProQuest LLC, 2011

Small sample equating remains a largely unexplored area of research. This study attempts to fill in some of the research gaps via a large-scale, IRT-based simulation study that evaluates the performance of seven small-sample equating methods under various test characteristic and sampling conditions. The equating methods considered are typically…

Descriptors: Test Length, Test Format, Sample Size, Simulation

Comparison of IRT Likelihood Ratio Test and Logistic Regression DIF Detection Procedures

Peer reviewed

Direct link

Atar, Burcu; Kamata, Akihito – Hacettepe University Journal of Education, 2011

The Type I error rates and the power of IRT likelihood ratio test and cumulative logit ordinal logistic regression procedures in detecting differential item functioning (DIF) for polytomously scored items were investigated in this Monte Carlo simulation study. For this purpose, 54 simulation conditions (combinations of 3 sample sizes, 2 sample…

Descriptors: Test Bias, Sample Size, Monte Carlo Methods, Item Response Theory

Validation of a Computerized Cognitive Assessment System for Persons with Stroke: A Pilot Study

Peer reviewed

Direct link

Yip, Chi Kwong; Man, David W. K. – International Journal of Rehabilitation Research, 2009

This study investigates the validity of a newly developed computerized cognitive assessment system (CCAS) that is equipped with rich multimedia to generate simulated testing situations and considers both test item difficulty and the test taker's ability. It is also hypothesized that better predictive validity of the CCAS in self-care of persons…

Descriptors: Test Items, Content Validity, Predictive Validity, Patients

Influence of Test and Person Characteristics on Nonparametric Appropriateness Measurement.

Peer reviewed

Meijer, Rob R.; And Others – Applied Psychological Measurement, 1994

The power of the nonparametric person-fit statistic, U3, is investigated through simulations as a function of item characteristics, test characteristics, person characteristics, and the group to which examinees belong. Results suggest conditions under which relatively short tests can be used for person-fit analysis. (SLD)

Descriptors: Difficulty Level, Group Membership, Item Response Theory, Nonparametric Statistics

Effects of Item Difficulty Heterogeneity on the Estimation of True-Score and Classification Consistency.

Spray, Judith A.; Welch, Catherine J. – 1986

The purpose of this study was to examine the effect that large within-examinee item difficulty variability had on estimates of the proportion of consistent classification of examinees into mastery categories over two test administrations. The classification consistency estimate was based on a single test administration from an estimation procedure…

Descriptors: Adults, Difficulty Level, Estimation (Mathematics), Mathematical Models

Andrews-Todd, Jessica	1
Atar, Burcu	1
Chamberlain, John	1
DeCarlo, Lawrence T.	1
Efendioglu, Akin	1
Forsyth, Carolyn	1
Guo, Hongwen	1
Horwitz, Paul	1
Kamata, Akihito	1
Kim, Sooyeon	1
Kogar, Hakan	1
Koon, Al	1
Kyllonen, Patrick	1
Man, David W. K.	1
McCulla, Laura	1
Meijer, Rob R.	1
Moses, Tim	1
Rupp, Andre	1
Spray, Judith A.	1
Steinberg, Jonathan	1
Sunnassee, Devdass	1
Welch, Catherine J.	1
Yip, Chi Kwong	1
Zhang, Zhidong	1
More ▼