ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	7

Descriptor

Simulation	12
Test Reliability	12
Evaluation Methods	5
Test Bias	4
Test Validity	4
Accuracy	3
Classification	3
Computation	3
Computer Assisted Testing	3
Item Analysis	3
Item Response Theory	3
Models	3
Scoring	3
Test Items	3
Correlation	2
Hypothesis Testing	2
Interrater Reliability	2
Knowledge Level	2
Licensing Examinations…	2
Measurement Techniques	2
Medical Education	2
Multiple Choice Tests	2
Scores	2
Statistical Significance	2
Testing	2
More ▼

Source

Journal of Educational and…	2
Academic Medicine	1
Applied Psychological…	1
Center for Education Data &…	1
European Journal of…	1
IEEE Transactions on Learning…	1
Journal of Consulting and…	1
Journal of Continuing…	1
Measurement:…	1
Psychometrika	1

Publication Type

Reports - Descriptive	12
Journal Articles	10
Speeches/Meeting Papers	1

Education Level

Higher Education	2
Postsecondary Education	2
Adult Education	1
Elementary Secondary Education	1

Audience

Researchers

Location

Russia

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 12 results Save | Export

R Packages for Item Response Theory Analysis: Descriptions and Features

Peer reviewed

Direct link

Choi, Youn-Jeng; Asilkalkan, Abdullah – Measurement: Interdisciplinary Research and Perspectives, 2019

About 45 R packages to analyze data using item response theory (IRT) have been developed over the last decade. This article introduces these 45 R packages with their descriptions and features. It also describes possible advanced IRT models using R packages, as well as dichotomous and polytomous IRT models, and R packages that contain applications…

Descriptors: Item Response Theory, Data Analysis, Computer Software, Test Bias

Testing Methodology in the Student Learning Process

Peer reviewed
PDF on ERIC

Download full text

Gorbunova, Tatiana N. – European Journal of Contemporary Education, 2017

The subject of the research is to build methodologies to evaluate the student knowledge by testing. The author points to the importance of feedback about the mastering level in the learning process. Testing is considered as a tool. The object of the study is to create the test system models for defence practice problems. Special attention is paid…

Descriptors: Testing, Evaluation Methods, Feedback (Response), Simulation

Item Response Theory for Peer Assessment

Peer reviewed

Direct link

Uto, Masaki; Ueno, Maomi – IEEE Transactions on Learning Technologies, 2016

As an assessment method based on a constructivist approach, peer assessment has become popular in recent years. However, in peer assessment, a problem remains that reliability depends on the rater characteristics. For this reason, some item response models that incorporate rater parameters have been proposed. Those models are expected to improve…

Descriptors: Item Response Theory, Peer Evaluation, Bayesian Statistics, Simulation

Screening Test Items for Differential Item Functioning

Peer reviewed

Direct link

Longford, Nicholas T. – Journal of Educational and Behavioral Statistics, 2014

A method for medical screening is adapted to differential item functioning (DIF). Its essential elements are explicit declarations of the level of DIF that is acceptable and of the loss function that quantifies the consequences of the two kinds of inappropriate classification of an item. Instead of a single level and a single function, sets of…

Descriptors: Test Items, Test Bias, Simulation, Hypothesis Testing

Assessing the "Rothstein Falsification Test": Does It Really Show Teacher Value-Added Models Are Biased? CEDR Working Paper No. 2012 1.3

Direct link

Goldhaber, Dan; Chaplin, Duncan – Center for Education Data & Research, 2012

In a provocative and influential paper, Jesse Rothstein (2010) finds that standard value added models (VAMs) suggest implausible future teacher effects on past student achievement, a finding that obviously cannot be viewed as causal. This is the basis of a falsification test (the Rothstein falsification test) that appears to indicate bias in VAM…

Descriptors: School Effectiveness, Teacher Effectiveness, Achievement Gains, Statistical Bias

Rater Training to Support High-Stakes Simulation-Based Assessments

Peer reviewed

Direct link

Feldman, Moshe; Lazzara, Elizabeth H.; Vanderbilt, Allison A.; DiazGranados, Deborah – Journal of Continuing Education in the Health Professions, 2012

Competency-based assessment and an emphasis on obtaining higher-level outcomes that reflect physicians' ability to demonstrate their skills has created a need for more advanced assessment practices. Simulation-based assessments provide medical education planners with tools to better evaluate the 6 Accreditation Council for Graduate Medical…

Descriptors: Performance Based Assessment, Physicians, Accuracy, High Stakes Tests

Multinomial and Compound Multinomial Error Models for Tests with Complex Item Scoring

Peer reviewed

Direct link

Lee, Won-Chan – Applied Psychological Measurement, 2007

This article introduces a multinomial error model, which models an examinee's test scores obtained over repeated measurements of an assessment that consists of polytomously scored items. A compound multinomial error model is also introduced for situations in which items are stratified according to content categories and/or prespecified numbers of…

Descriptors: Simulation, Error of Measurement, Scoring, Test Items

The Order-Restricted Association Model: Two Estimation Algorithms and Issues in Testing

Peer reviewed

Direct link

Galindo-Garre, Francisca; Vermunt, Jeroen K. – Psychometrika, 2004

This paper presents a row-column (RC) association model in which the estimated row and column scores are forced to be in agreement with a priori specified ordering. Two efficient algorithms for finding the order-restricted maximum likelihood (ML) estimates are proposed and their reliability under different degrees of association is investigated by…

Descriptors: Mathematics, Test Reliability, Computation, Testing

A Sharing Item Response Theory Model for Computerized Adaptive Testing

Peer reviewed

Direct link

Segall, Daniel O. – Journal of Educational and Behavioral Statistics, 2004

A new sharing item response theory (SIRT) model is presented that explicitly models the effects of sharing item content between informants and test takers. This model is used to construct adaptive item selection and scoring rules that provide increased precision and reduced score gains in instances where sharing occurs. The adaptive item selection…

Descriptors: Scoring, Item Analysis, Item Response Theory, Adaptive Testing

Assessing Clinical Significance: Does it Matter which Method we Use?

Peer reviewed

Direct link

Atkins, David C.; Bedics, Jamie D.; Mcglinchey, Joseph B.; Beauchaine, Theodore P. – Journal of Consulting and Clinical Psychology, 2005

Measures of clinical significance are frequently used to evaluate client change during therapy. Several alternatives to the original method devised by N. S. Jacobson, W. C. Follette, & D. Revenstorf (1984) have been proposed, each purporting to increase accuracy. However, researchers have had little systematic guidance in choosing among…

Descriptors: Psychotherapy, Statistical Significance, Outcomes of Treatment, Behavior Change

Status Report on the NBME's Computer-Based Testing.

Peer reviewed

Clyman, Stephen G.; Orr, Nancy A. – Academic Medicine, 1990

The process proposed for the development and use of computer-based testing, including simulation and multiple-choice questions, as part of the National Board of Medical Examiners' certification sequence is outlined. Summary reports of first-phase pilot testing in six medical schools are appended. (MSE)

Descriptors: Computer Assisted Testing, Higher Education, Licensing Examinations (Professions), Medical Education

The Safety Simulator: Scoring, Reliability and Validity of Interactive Videodisc-Based Assessment of Science Teachers.

Download full text

Lomask, Michal S.; And Others – 1993

An experimental Interactive Video Disc (IVD) assessment program, funded partially by the National Science Foundation, was developed to assess science teachers' knowledge of safe management of lab facilities and activities. The IVD program contains two phases: (1) panoramic view of the lab room, including safety equipment and storage of chemicals;…

Descriptors: Evaluation Methods, High Schools, Interactive Video, Junior High School Students

Asilkalkan, Abdullah	1
Atkins, David C.	1
Beauchaine, Theodore P.	1
Bedics, Jamie D.	1
Chaplin, Duncan	1
Choi, Youn-Jeng	1
Clyman, Stephen G.	1
DiazGranados, Deborah	1
Feldman, Moshe	1
Galindo-Garre, Francisca	1
Goldhaber, Dan	1
Gorbunova, Tatiana N.	1
Lazzara, Elizabeth H.	1
Lee, Won-Chan	1
Lomask, Michal S.	1
Longford, Nicholas T.	1
Mcglinchey, Joseph B.	1
Orr, Nancy A.	1
Segall, Daniel O.	1
Ueno, Maomi	1
Uto, Masaki	1
Vanderbilt, Allison A.	1
Vermunt, Jeroen K.	1
More ▼