ERIC - Search Results

Publication Date

In 2025	0
Since 2024	2
Since 2021 (last 5 years)	16
Since 2016 (last 10 years)	32
Since 2006 (last 20 years)	72

Descriptor

Licensing Examinations…	153
Test Items	153
Test Construction	49
Difficulty Level	34
Item Response Theory	33
Multiple Choice Tests	31
Higher Education	27
Certification	24
Scores	24
Cutting Scores	23
Standard Setting (Scoring)	22
Test Format	22
Computer Assisted Testing	20
Test Validity	20
Statistical Analysis	19
Item Analysis	18
Equated Scores	17
Teacher Certification	17
Comparative Analysis	16
Physicians	15
Scoring	15
Classification	12
Foreign Countries	12
Testing Programs	12
Correlation	11
More ▼

Publication Type

Reports - Research	95
Journal Articles	79
Speeches/Meeting Papers	52
Reports - Evaluative	44
Reports - Descriptive	8
Dissertations/Theses -…	4
Tests/Questionnaires	2
Guides - General	1

Education Level

Higher Education	13
Postsecondary Education	11
Elementary Education	3
Adult Education	2
Elementary Secondary Education	2
Junior High Schools	2
Middle Schools	2
Early Childhood Education	1
Grade 8	1
Secondary Education	1
Two Year Colleges	1
More ▼

Audience

Researchers	4
Teachers	2
Administrators	1
Policymakers	1
Practitioners	1

Location

Canada	5
Tennessee	4
Saudi Arabia	3
Arizona	2
California	2
United States	2
Australia	1
China	1
Colorado	1
Florida	1
Georgia	1
Illinois	1
Massachusetts	1
Michigan	1
Missouri	1
New York	1
Ohio	1
Oklahoma	1
Philippines	1
Texas	1
Virginia	1
More ▼

Laws, Policies, & Programs

Comprehensive Education…

Assessments and Surveys

United States Medical…	6
National Teacher Examinations	4
Praxis Series	3
ACT Assessment	1
Bar Examinations	1
Florida Comprehensive…	1
National Council Licensure…	1
Pre Professional Skills Tests	1
SAT (College Admission Test)	1
Test of English as a Foreign…	1
Test of English for…	1
More ▼

What Works Clearinghouse Rating

Showing 1 to 15 of 153 results Save | Export

Reporting Pass-Fail Decisions to Examinees with Incomplete Data: A Commentary on Feinberg (2021)

Peer reviewed

Direct link

Sinharay, Sandip – Educational Measurement: Issues and Practice, 2022

Administrative problems such as computer malfunction and power outage occasionally lead to missing item scores, and hence to incomplete data, on credentialing tests such as the United States Medical Licensing examination. Feinberg compared four approaches for reporting pass-fail decisions to the examinees with incomplete data on credentialing…

Descriptors: Testing Problems, High Stakes Tests, Credentials, Test Items

Utilizing Linear Logistic Test Models to Explore Item Characteristics of Medical Subspecialty Certification Examinations

Peer reviewed

Direct link

Emily K. Toutkoushian; Huaping Sun; Mark T. Keegan; Ann E. Harman – Measurement: Interdisciplinary Research and Perspectives, 2024

Linear logistic test models (LLTMs), leveraging item response theory and linear regression, offer an elegant method for learning about item characteristics in complex content areas. This study used LLTMs to model single-best-answer, multiple-choice-question response data from two medical subspecialty certification examinations in multiple years…

Descriptors: Licensing Examinations (Professions), Certification, Medical Students, Test Items

Comparing Drift Detection Methods for Accurate Rasch Equating in Different Sample Sizes

Peer reviewed

Direct link

Alahmadi, Sarah; Jones, Andrew T.; Barry, Carol L.; Ibáñez, Beatriz – Applied Measurement in Education, 2023

Rasch common-item equating is often used in high-stakes testing to maintain equivalent passing standards across test administrations. If unaddressed, item parameter drift poses a major threat to the accuracy of Rasch common-item equating. We compared the performance of well-established and newly developed drift detection methods in small and large…

Descriptors: Equated Scores, Item Response Theory, Sample Size, Test Items

The Lack of Robustness of a Statistic Based on the Neyman-Pearson Lemma to Violations of Its Underlying Assumptions

Peer reviewed
PDF on ERIC

Download full text

Direct link

Sinharay, Sandip – Grantee Submission, 2021

Drasgow, Levine, and Zickar (1996) suggested a statistic based on the Neyman-Pearson lemma (e.g., Lehmann & Romano, 2005, p. 60) for detecting preknowledge on a known set of items. The statistic is a special case of the optimal appropriateness indices of Levine and Drasgow (1988) and is the most powerful statistic for detecting item…

Descriptors: Robustness (Statistics), Hypothesis Testing, Statistics, Test Items

Scoring Tests with Contaminated Response Vectors

Peer reviewed

Direct link

Sakworawich, Arnond; Wainer, Howard – Journal of Educational and Behavioral Statistics, 2020

Test scoring models vary in their generality, some even adjust for examinees answering multiple-choice items correctly by accident (guessing), but no models, that we are aware of, automatically adjust an examinee's score when there is internal evidence of cheating. In this study, we use a combination of jackknife technology with an adaptive robust…

Descriptors: Scoring, Cheating, Test Items, Licensing Examinations (Professions)

Score Comparability Issues with At-Home Testing and How to Address Them

Peer reviewed

Direct link

Puhan, Gautam; Kim, Sooyeon – Journal of Educational Measurement, 2022

As a result of the COVID-19 pandemic, at-home testing has become a popular delivery mode in many testing programs. When programs offer at-home testing to expand their service, the score comparability between test takers testing remotely and those testing in a test center is critical. This article summarizes statistical procedures that could be…

Descriptors: Scores, Scoring, Comparative Analysis, Testing

Item Equivalence Verification According to Test Information Media of the Optician National Licensing Examination: Focused on the Smart Device Based and Paper Based Tests Including Multimedia Items

Peer reviewed
PDF on ERIC

Download full text

Jang, Jung Un; Kim, Eun Joo – Journal of Curriculum and Teaching, 2022

This study conducts the validity of the pen-and-paper and smart-device-based tests on optician's examination. The developed questions for each media were based on the national optician's simulation test. The subjects of this study were 60 students enrolled in E University. The data analysis was performed to verify the equivalence of the two…

Descriptors: Optometry, Licensing Examinations (Professions), Test Format, Test Validity

Evaluating Different Scoring Methods for Multiple Response Items Providing Partial Credit

Peer reviewed

Direct link

Betts, Joe; Muntean, William; Kim, Doyoung; Kao, Shu-chuan – Educational and Psychological Measurement, 2022

The multiple response structure can underlie several different technology-enhanced item types. With the increased use of computer-based testing, multiple response items are becoming more common. This response type holds the potential for being scored polytomously for partial credit. However, there are several possible methods for computing raw…

Descriptors: Scoring, Test Items, Test Format, Raw Scores

Comparison of R Packages for Automated Test Assembly with Mixed-Integer Linear Programming

Peer reviewed

Direct link

Peabody, Michael R. – Measurement: Interdisciplinary Research and Perspectives, 2023

Many organizations utilize some form of automation in the test assembly process; either fully algorithmic or heuristically constructed. However, one issue with heuristic models is that when the test assembly problem changes the entire model may need to be re-conceptualized and recoded. In contrast, mixed-integer programming (MIP) is a mathematical…

Descriptors: Programming Languages, Algorithms, Heuristics, Mathematical Models

A Method for Converting 4-Option Multiple-Choice Items to 3-Option Multiple-Choice Items without Re-Pretesting

Peer reviewed
PDF on ERIC

Download full text

Wolkowitz, Amanda A.; Foley, Brett; Zurn, Jared – Practical Assessment, Research & Evaluation, 2023

The purpose of this study is to introduce a method for converting scored 4-option multiple-choice (MC) items into scored 3-option MC items without re-pretesting the 3-option MC items. This study describes a six-step process for achieving this goal. Data from a professional credentialing exam was used in this study and the method was applied to 24…

Descriptors: Multiple Choice Tests, Test Items, Accuracy, Test Format

Testing the Knowledge of Early Childhood Educators

Peer reviewed
PDF on ERIC

Download full text

Dianne S. McCarthy – Journal of Inquiry and Action in Education, 2023

Teacher certification exams are supposed to assess if a student is likely to succeed in teaching. What if an exam seems to be inappropriate? This article is an inquiry of the New York State Content Specialty Test for Early Childhood Candidates, particularly the math section. It raises the issue of whether we are asking the right questions and…

Descriptors: Teacher Certification, Licensing Examinations (Professions), Preservice Teachers, Early Childhood Teachers

Estimating Classification Decisions for Incomplete Tests

Peer reviewed

Direct link

Feinberg, Richard A. – Educational Measurement: Issues and Practice, 2021

Unforeseen complications during the administration of large-scale testing programs are inevitable and can prevent examinees from accessing all test material. For classification tests in which the primary purpose is to yield a decision, such as a pass/fail result, the current study investigated a model-based standard error approach, Bayesian…

Descriptors: High Stakes Tests, Classification, Decision Making, Bayesian Statistics

Hybrid Threshold-Based Sequential Procedures for Detecting Compromised Items in a Computerized Adaptive Testing Licensure Exam

Peer reviewed

Direct link

Lee, Chansoon; Qian, Hong – Educational and Psychological Measurement, 2022

Using classical test theory and item response theory, this study applied sequential procedures to a real operational item pool in a variable-length computerized adaptive testing (CAT) to detect items whose security may be compromised. Moreover, this study proposed a hybrid threshold approach to improve the detection power of the sequential…

Descriptors: Computer Assisted Testing, Adaptive Testing, Licensing Examinations (Professions), Item Response Theory

The Effect of Repeat Exposure to Simulation Based Items

Peer reviewed
PDF on ERIC

Download full text

Tang, Xiaodan; Schultz, Matthew – Practical Assessment, Research & Evaluation, 2020

This study aims to examine the potential impacts on repeat examinees' performance by reusing simulation-based items in a high-stakes standardized assessment. We examined change patterns of item scores, ability estimate, score pattern change, response time and compared the performance of repeat examinees who have received repeat items and those who…

Descriptors: Test Items, Repetition, Simulation, Standardized Tests

Reinstatement Candidate Credentialing Exam Performance: Evaluating the Persistence of Misinformed Responses on Multiple Choice Items

Peer reviewed
PDF on ERIC

Download full text

Babcock, Ben; Siegel, Zachary D. – Practical Assessment, Research & Evaluation, 2022

Research about repeated testing has revealed that retaking the same exam form generally does not advantage or disadvantage failing candidates in selected response-style credentialing exams. Feinberg, Raymond, and Haist (2015) found a contributing factor to this phenomenon: people answering items incorrectly on both attempts give the same incorrect…

Descriptors: Multiple Choice Tests, Item Analysis, Test Items, Response Style (Tests)

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11

Applied Measurement in…	12
Educational and Psychological…	8
Journal of Educational…	7
Practical Assessment,…	7
ETS Research Report Series	5
Educational Measurement:…	5
Evaluation and the Health…	4
ProQuest LLC	4
Educational Testing Service	3
International Journal of…	3
Journal of Applied Testing…	3
Measurement:…	3
Online Submission	3
Applied Psychological…	2
Grantee Submission	2
Journal of Educational and…	2
AERA Online Paper Repository	1
Academic Medicine	1
Education Working Paper…	1
Educational Assessment	1
English Language Teaching	1
Eurasian Journal of…	1
International Education…	1
International Journal of…	1
Journal of Curriculum and…	1
More ▼

Sinharay, Sandip	7
Sykes, Robert C.	7
Kim, Sooyeon	5
Sireci, Stephen G.	5
Cizek, Gregory J.	4
Buckendahl, Chad W.	3
Davis-Becker, Susan L.	3
Gilmer, Jerry S.	3
Gonzalez-Tamayo, Eulogio	3
Haladyna, Thomas M.	3
Ito, Kyoko	3
Kannan, Priya	3
Kromrey, Jeffrey D.	3
Qian, Hong	3
Raymond, Mark R.	3
Shen, Linjun	3
Tannenbaum, Richard J.	3
von Davier, Alina A.	3
Alsadaawi, Abdullah Saleh	2
Babcock, Ben	2
Betts, Joe	2
Bowman, Harry L.	2
Feinberg, Richard A.	2
Gerrow, Jack	2
More ▼