ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	0
Since 2017 (last 10 years)	2
Since 2007 (last 20 years)	19

Descriptor

Difficulty Level	41
Educational Testing	41
Test Items	16
Higher Education	9
Test Construction	9
Test Interpretation	9
Achievement Tests	8
Educational Assessment	8
Item Analysis	8
Multiple Choice Tests	8
Elementary Secondary Education	7
Item Response Theory	7
Statistical Analysis	7
Computer Assisted Testing	6
Reading Tests	6
Standardized Tests	6
Test Reliability	6
Test Bias	5
Test Results	5
Testing Programs	5
Academic Achievement	4
Comparative Analysis	4
Correlation	4
Elementary Education	4
Foreign Countries	4
More ▼

Publication Type

Reports - Research	21
Journal Articles	17
Reports - Evaluative	6
Dissertations/Theses -…	5
Information Analyses	5
Opinion Papers	3
Guides - Non-Classroom	2
Speeches/Meeting Papers	2
Tests/Questionnaires	2
ERIC Publications	1
Non-Print Media	1
Numerical/Quantitative Data	1
Reference Materials - General	1
Reports - Descriptive	1
More ▼

Education Level

Elementary Secondary Education	6
Higher Education	5
Elementary Education	4
Postsecondary Education	4
Grade 4	2
Grade 6	2
High Schools	2
Secondary Education	2
Grade 12	1
Grade 5	1
Grade 8	1
Intermediate Grades	1
Middle Schools	1
More ▼

Audience

Location

Taiwan	2
Arizona	1
Australia	1
Ghana	1
New York	1
North Carolina	1
Pennsylvania	1
Tennessee	1
United Kingdom	1

Laws, Policies, & Programs

No Child Left Behind Act 2001	2
Elementary and Secondary…	1

Assessments and Surveys

National Assessment of…	3
Program for International…	2
SAT (College Admission Test)	2
Graduate Record Examinations	1

What Works Clearinghouse Rating

Showing 1 to 15 of 41 results Save | Export

Somers' D as an Alternative for the Item-Test and Item-Rest Correlation Coefficients in the Educational Measurement Settings

Peer reviewed
PDF on ERIC

Download full text

Metsämuuronen, Jari – International Journal of Educational Methodology, 2020

Pearson product-moment correlation coefficient between item g and test score X, known as item-test or item-total correlation ("Rit"), and item-rest correlation ("Rir") are two of the most used classical estimators for item discrimination power (IDP). Both "Rit" and "Rir" underestimate IDP caused by the…

Descriptors: Correlation, Test Items, Scores, Difficulty Level

On the Issue of Item Selection in Computerized Adaptive Testing with Response Times

Peer reviewed

Direct link

Veldkamp, Bernard P. – Journal of Educational Measurement, 2016

Many standardized tests are now administered via computer rather than paper-and-pencil format. The computer-based delivery mode brings with it certain advantages. One advantage is the ability to adapt the difficulty level of the test to the ability level of the test taker in what has been termed computerized adaptive testing (CAT). A second…

Descriptors: Computer Assisted Testing, Reaction Time, Standardized Tests, Difficulty Level

The Interaction of Ability Differences and Guessing When Modeling Differential Item Functioning with the Rasch Model: Conventional and Tailored Calibration

Peer reviewed

Direct link

DeMars, Christine E.; Jurich, Daniel P. – Educational and Psychological Measurement, 2015

In educational testing, differential item functioning (DIF) statistics must be accurately estimated to ensure the appropriate items are flagged for inspection or removal. This study showed how using the Rasch model to estimate DIF may introduce considerable bias in the results when there are large group differences in ability (impact) and the data…

Descriptors: Test Bias, Guessing (Tests), Ability, Differences

Using Reliability and Item Analysis to Evaluate a Teacher-Developed Test in Educational Measurement and Evaluation

Peer reviewed

Direct link

Quaigrain, Kennedy; Arhin, Ato Kwamina – Cogent Education, 2017

Item analysis is essential in improving items which will be used again in later tests; it can also be used to eliminate misleading items in a test. The study focused on item and test quality and explored the relationship between difficulty index (p-value) and discrimination index (DI) with distractor efficiency (DE). The study was conducted among…

Descriptors: Item Analysis, Teacher Developed Materials, Test Reliability, Educational Assessment

An Investigation of the Efficacy of Criterion Refinement Procedures in Mantel-Haenszel DIF Analysis. Research Report. ETS RR-13-16

Peer reviewed
PDF on ERIC

Download full text

Zwick, Rebecca; Ye, Lei; Isham, Steven – ETS Research Report Series, 2013

Differential item functioning (DIF) analysis is a key component in the evaluation of the fairness and validity of educational tests. Although it is often assumed that refinement of the matching criterion always provides more accurate DIF results, the actual situation proves to be more complex. To explore the effectiveness of refinement, we…

Descriptors: Test Bias, Statistical Analysis, Simulation, Educational Testing

Investigating the Relationships between a Reading Test and Can-Do Statements of Performance on Reading Tasks

Direct link

Liu, Hsin-min – ProQuest LLC, 2014

One of the fundamental problems in language testing is the lack of adequate generalizability between what a test is measuring and what fulfills the learners' real world language use needs. It is important to recognize that no matter how precise a test measures a construct, if the way that a construct is defined and the way that test tasks are…

Descriptors: Reading Tests, Language Tests, Task Analysis, Generalizability Theory

Evaluating the Bookmark Standard Setting Method: The Impact of Random Item Ordering

Peer reviewed

Direct link

Davis-Becker, Susan L.; Buckendahl, Chad W.; Gerrow, Jack – International Journal of Testing, 2011

Throughout the world, cut scores are an important aspect of a high-stakes testing program because they are a key operational component of the interpretation of test scores. One method for setting standards that is prevalent in educational testing programs--the Bookmark method--is intended to be a less cognitively complex alternative to methods…

Descriptors: Standard Setting (Scoring), Cutting Scores, Educational Testing, Licensing Examinations (Professions)

College and Career Readiness: An Initial Validation Argument

Download full text

Camara, Wayne – College Board, 2011

This presentation was presented at the 2011 National Conference on Student Assessment (CCSSO). The focus of this presentation is how to validate the common core state standards (CCSS) in math and ELA and the subsequent assessments that will be developed by state consortia. The CCSS specify the skills students need to be ready for post-secondary…

Descriptors: College Readiness, Career Readiness, Benchmarking, Student Evaluation

The Advanced Placement Program in Pennsylvania: Implications for Policy and Practice in K-12 and Higher Education

Direct link

Liekar, Christine Y. – ProQuest LLC, 2012

Since the time of Sputnik, American educators and policymakers have recognized the need to raise expectations by increasing rigor in high schools across the United States. Copious studies attest to the fact that students who take Advanced Placement coursework experience success in college (Adelman, 1999; Camara, 2003; College Board, 2005;…

Descriptors: High School Students, Advanced Placement Programs, Educational Policy, Educational Practices

Toward Educational Testing Reform: Inside Reading Achievement Tests

Peer reviewed
PDF on ERIC

Download full text

Schutz, Dick – Education Policy Analysis Archives, 2013

The commentary (1) uses the U. S. National Assessment of Educational Progress (NAEP) as a prototype for examining standardized reading achievement tests at the item level, and (2) sketches an alternative based on an initiative underway in the United Kingdom.

Descriptors: Educational Testing, Educational Change, Achievement Tests, Reading Achievement

Determination of a Predictive Model for the Fundamentals of Engineering Examination

Direct link

Wheeler, Edward W. – ProQuest LLC, 2012

In early 1995, the University of Tennessee at Martin (UTM) sought permission to terminate three existing engineering technology degree programs and replace them with a single Bachelor of Science in Engineering (BSE) degree. As part of the requirements to proceed with the implementation of an engineering program, the University of Tennessee system…

Descriptors: Engineering, Engineering Education, Models, Prediction

Fixing the c Parameter in the Three-Parameter Logistic Model

Peer reviewed
PDF on ERIC

Download full text

Han, Kyung T. – Practical Assessment, Research & Evaluation, 2012

For several decades, the "three-parameter logistic model" (3PLM) has been the dominant choice for practitioners in the field of educational measurement for modeling examinees' response data from multiple-choice (MC) items. Past studies, however, have pointed out that the c-parameter of 3PLM should not be interpreted as a guessing…

Descriptors: Statistical Analysis, Models, Multiple Choice Tests, Guessing (Tests)

An Enhanced Learning Diagnosis Model Based on Concept-Effect Relationships with Multiple Knowledge Levels

Peer reviewed

Direct link

Chu, Hui-Chun; Hwang, Gwo-Jen; Huang, Yueh-Min – Innovations in Education and Teaching International, 2010

Conventional testing systems usually give students a score as their test result, but do not show them how to improve their learning performance. Researchers have indicated that students would benefit more if individual learning guidance could be provided. However, most of the existing learning diagnosis models ignore the fact that one concept…

Descriptors: Test Results, Teaching Methods, Elementary School Students, Elementary School Teachers

A Case Study to Explore Rigorous Teaching and Testing Practices to Narrow the Achievement Gap

Direct link

Isler, Tesha – ProQuest LLC, 2012

The problem examined in this study: Does the majority of teachers use rigorous teaching and testing practices? The purpose of this qualitative exploratory case study was to explore the classroom techniques of six effective teachers who use rigorous teaching and testing practices. The hypothesis for this study is that the examination of the…

Descriptors: Difficulty Level, Teaching Methods, Educational Testing, Achievement Gap

Implementation of an Improved Adaptive Testing Theory

Peer reviewed

Direct link

Al-A'ali, Mansoor – Educational Technology & Society, 2007

Computer adaptive testing is the study of scoring tests and questions based on assumptions concerning the mathematical relationship between examinees' ability and the examinees' responses. Adaptive student tests, which are based on item response theory (IRT), have many advantages over conventional tests. We use the least square method, a…

Descriptors: Educational Testing, Higher Education, Elementary Secondary Education, Student Evaluation

Previous Page | Next Page »

Pages: 1 | 2 | 3

ProQuest LLC	5
Alberta Journal of…	1
Applied Psychological…	1
Clearing House	1
Cogent Education	1
College Board	1
Contemporary Educational…	1
ETS Research Report Series	1
Education Policy Analysis…	1
Educational Technology &…	1
Educational and Psychological…	1
Innovations in Education and…	1
International Journal of…	1
International Journal of…	1
International Journal of…	1
Journal of Educational…	1
Journal of Educational…	1
Journal of Educational…	1
Journal of Experimental…	1
Ministerial Council on…	1
Practical Assessment,…	1
Psychometrika	1
More ▼

Cahen, Leonard S.	3
Al-A'ali, Mansoor	1
Arhin, Ato Kwamina	1
Barden, Tiffannie M.	1
Blasius, Jorg	1
Buckendahl, Chad W.	1
Camara, Wayne	1
Chen, Deng-Jyi	1
Chen, Shu-Ling	1
Chu, Hui-Chun	1
Davis-Becker, Susan L.	1
DeMars, Christine E.	1
Donovan, Jenny	1
Flaherty, Etienne	1
Gerrow, Jack	1
Gilmer, Jerry S.	1
Han, Kyung T.	1
Harvey, Anne L.	1
Huang, Yueh-Min	1
Hunter-Blanks, Patricia	1
Hutton, Penny	1
Hwang, Gwo-Jen	1
Isham, Steven	1
Isler, Tesha	1
More ▼