ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	5
Since 2006 (last 20 years)	10

Descriptor

Difficulty Level	11
Scores	11
Test Theory	11
Test Items	10
Computation	4
Item Analysis	3
Item Response Theory	3
Measurement Techniques	3
Psychometrics	3
Test Reliability	3
Test Validity	3
Foreign Countries	2
Multiple Choice Tests	2
Sample Size	2
Simulation	2
Statistical Analysis	2
Student Evaluation	2
Test Construction	2
Young Children	2
Achievement Tests	1
Advanced Placement	1
Alternative Assessment	1
Biochemistry	1
Clinical Diagnosis	1
Coding	1
More ▼

Source

Assessment for Effective…	1
Behavioral Research and…	1
Biochemistry and Molecular…	1
College Board	1
IEEE Transactions on Education	1
International Journal of…	1
International Journal of…	1
Journal of Educational…	1
Journal of Science Education…	1
Language Testing	1

Publication Type

Reports - Research	9
Journal Articles	8
Numerical/Quantitative Data	1
Reports - Descriptive	1
Reports - Evaluative	1
Speeches/Meeting Papers	1

Education Level

Elementary Education	3
Early Childhood Education	1
Grade 1	1
Grade 2	1
Grade 3	1
Grade 4	1
Grade 5	1
Grade 6	1
Grade 7	1
Grade 8	1
High Schools	1
Higher Education	1
Kindergarten	1
Middle Schools	1
Postsecondary Education	1
Primary Education	1
Secondary Education	1
More ▼

Audience

Location

Canada	1
Europe	1
Florida	1
United States	1

Laws, Policies, & Programs

Assessments and Surveys

Stanford Early School…	1
Test of English as a Foreign…	1

What Works Clearinghouse Rating

Showing all 11 results Save | Export

Assessment of Item and Test Parameters: Cosine Similarity Approach

Peer reviewed
PDF on ERIC

Download full text

Chakrabartty, Satyendra Nath – International Journal of Psychology and Educational Studies, 2021

The paper proposes new measures of difficulty and discriminating values of binary items and test consisting of such items and find their relationships including estimation of test error variance and thereby the test reliability, as per definition using cosine similarities. The measures use entire data. Difficulty value of test and item is defined…

Descriptors: Test Items, Difficulty Level, Scores, Test Reliability

Classical Item Analysis from a Signal Detection Perspective

Peer reviewed

Direct link

DeCarlo, Lawrence T. – Journal of Educational Measurement, 2023

A conceptualization of multiple-choice exams in terms of signal detection theory (SDT) leads to simple measures of item difficulty and item discrimination that are closely related to, but also distinct from, those used in classical item analysis (CIA). The theory defines a "true split," depending on whether or not examinees know an item,…

Descriptors: Multiple Choice Tests, Test Items, Item Analysis, Test Wiseness

Effects of Various Simulation Conditions on Latent-Trait Estimates: A Simulation Study

Peer reviewed
PDF on ERIC

Download full text

Kogar, Hakan – International Journal of Assessment Tools in Education, 2018

The aim of this simulation study, determine the relationship between true latent scores and estimated latent scores by including various control variables and different statistical models. The study also aimed to compare the statistical models and determine the effects of different distribution types, response formats and sample sizes on latent…

Descriptors: Simulation, Context Effect, Computation, Statistical Analysis

Facilitating the Interpretation of English Language Proficiency Scores: Combining Scale Anchoring and Test Score Mapping Methodologies

Peer reviewed

Direct link

Powers, Donald; Schedl, Mary; Papageorgiou, Spiros – Language Testing, 2017

The aim of this study was to develop, for the benefit of both test takers and test score users, enhanced "TOEFL ITP"® test score reports that go beyond the simple numerical scores that are currently reported. To do so, we applied traditional scale anchoring (proficiency scaling) to item difficulty data in order to develop performance…

Descriptors: English (Second Language), Second Language Learning, Language Proficiency, Scores

"TechCheck": Development and Validation of an Unplugged Assessment of Computational Thinking in Early Childhood Education

Peer reviewed

Direct link

Relkin, Emily; de Ruiter, Laura; Bers, Marina Umaschi – Journal of Science Education and Technology, 2020

There is a need for developmentally appropriate Computational Thinking (CT) assessments that can be implemented in early childhood classrooms. We developed a new instrument called "TechCheck" for assessing CT skills in young children that does not require prior knowledge of computer programming. "TechCheck" is based on…

Descriptors: Developmentally Appropriate Practices, Computation, Thinking Skills, Early Childhood Education

Rating Quality Studies Using Rasch Measurement Theory. Research Report 2013-3

Download full text

Engelhard, George, Jr.; Wind, Stefanie A. – College Board, 2013

The major purpose of this study is to examine the quality of ratings assigned to CR (constructed-response) questions in large-scale assessments from the perspective of Rasch Measurement Theory. Rasch Measurement Theory provides a framework for the examination of rating scale category structure that can yield useful information for interpreting the…

Descriptors: Measurement Techniques, Rating Scales, Test Theory, Scores

Development of the Enzyme-Substrate Interactions Concept Inventory

Peer reviewed

Direct link

Bretz, Stacey Lowery; Linenberger, Kimberly J. – Biochemistry and Molecular Biology Education, 2012

Enzyme function is central to student understanding of multiple topics within the biochemistry curriculum. In particular, students must understand how enzymes and substrates interact with one another. This manuscript describes the development of a 15-item Enzyme-Substrate Interactions Concept Inventory (ESICI) that measures student understanding…

Descriptors: Biochemistry, Science Education, Science Instruction, Scientific Concepts

A Control Systems Concept Inventory Test Design and Assessment

Peer reviewed

Direct link

Bristow, M.; Erkorkmaz, K.; Huissoon, J. P.; Jeon, Soo; Owen, W. S.; Waslander, S. L.; Stubley, G. D. – IEEE Transactions on Education, 2012

Any meaningful initiative to improve the teaching and learning in introductory control systems courses needs a clear test of student conceptual understanding to determine the effectiveness of proposed methods and activities. The authors propose a control systems concept inventory. Development of the inventory was collaborative and iterative. The…

Descriptors: Diagnostic Tests, Concept Formation, Undergraduate Students, Engineering Education

Efficiency of Predicting Risk in Word Reading Using Fewer, Easier Letters

Peer reviewed

Direct link

Petscher, Yaacov; Kim, Young-Suk – Assessment for Effective Intervention, 2011

Letter-name identification has been widely used as part of early screening to identify children who might be at risk for future word reading difficulty. The goal of the present study was to examine whether a reduced set of letters could have similar diagnostic accuracy rather than a full set (i.e., 26 letters) when used as a screen. First, we…

Descriptors: Clinical Diagnosis, Measures (Individuals), Risk, Reading

Instrument Development Procedures for Mathematics Measures. Technical Report Number 08-02

Download full text

Jung, Eunju; Liu, Kimy; Ketterlin-Geller, Leanne R.; Tindal, Gerald – Behavioral Research and Teaching, 2008

The purpose of this study was to develop general outcome measures (GOM) in mathematics so that teachers could focus their instruction on needed prerequisite skills. We describe in detail, the manner in which content-related evidence was established and then present a number of statistical analyses conducted to evaluate the technical adequacy of…

Descriptors: Item Analysis, Test Construction, Test Theory, Mathematics Tests

Evaluation of the Plot Method for Identifying Potentially Biased Test Items.

Download full text

Hambleton, Ronald K.; Rogers, H. Jane – 1986

This report was designed to respond to two major methodological shortcomings in the item bias literature: (1) misfitting test models; and (2) the use of significance tests. Specifically, the goals of the research were to describe a newly developed method known as the "plot method" for identifying potentially biased test items and to…

Descriptors: Criterion Referenced Tests, Culture Fair Tests, Difficulty Level, Estimation (Mathematics)

Bers, Marina Umaschi	1
Bretz, Stacey Lowery	1
Bristow, M.	1
Chakrabartty, Satyendra Nath	1
DeCarlo, Lawrence T.	1
Engelhard, George, Jr.	1
Erkorkmaz, K.	1
Hambleton, Ronald K.	1
Huissoon, J. P.	1
Jeon, Soo	1
Jung, Eunju	1
Ketterlin-Geller, Leanne R.	1
Kim, Young-Suk	1
Kogar, Hakan	1
Linenberger, Kimberly J.	1
Liu, Kimy	1
Owen, W. S.	1
Papageorgiou, Spiros	1
Petscher, Yaacov	1
Powers, Donald	1
Relkin, Emily	1
Rogers, H. Jane	1
Schedl, Mary	1
Stubley, G. D.	1
Tindal, Gerald	1
More ▼