ERIC - Search Results

Publication Date

In 2025	0
Since 2024	2
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	9

Descriptor

Difficulty Level	24
Item Analysis	24
Measurement Techniques	24
Test Items	18
Test Construction	9
Mathematical Models	7
Test Validity	6
Psychometrics	5
Test Reliability	5
Achievement Tests	4
Comparative Analysis	4
Elementary Secondary Education	4
Latent Trait Theory	4
Statistical Analysis	4
Ability	3
Adaptive Testing	3
Evaluation Methods	3
Guessing (Tests)	3
Item Banks	3
Measurement Objectives	3
Multiple Choice Tests	3
Scaling	3
Scores	3
Scoring	3
Test Interpretation	3
More ▼

Source

Educational and Psychological…	2
Behavioral Research and…	1
College Teaching	1
Journal of Applied Testing…	1
Journal of Chemical Education	1
Journal of Educational…	1
Journal of Intellectual…	1
Journal of Psychoeducational…	1
Language Testing	1
Learning and Individual…	1
ProQuest LLC	1
More ▼

Publication Type

Reports - Research	12
Journal Articles	8
Speeches/Meeting Papers	6
Reports - Evaluative	3
Guides - Non-Classroom	2
Information Analyses	2
Dissertations/Theses -…	1
Guides - General	1
Numerical/Quantitative Data	1
Reports - Descriptive	1
Tests/Questionnaires	1
More ▼

Education Level

Higher Education	2
Grade 1	1
Postsecondary Education	1

Audience

Researchers

Location

Germany	1
Japan	1
Russia	1
United States	1

Laws, Policies, & Programs

Assessments and Surveys

ACT Assessment	1
Stanford Binet Intelligence…	1
UCLA Loneliness Scale	1

What Works Clearinghouse Rating

Showing 1 to 15 of 24 results Save | Export

On New and Improved Measures for Item Analysis from Signal Detection Theory

Direct link

Rachel Lee – ProQuest LLC, 2024

Classical item analysis (CIA) entails summarizing items based on two key attributes: item difficulty and item discrimination, defined as the proportion of examinees answering correctly and the difference in correctness between high and low scorers. Recent insights reveal a direct link between these measures and aspects of signal detection theory…

Descriptors: Item Analysis, Knowledge Level, Difficulty Level, Measurement

Cumulative Ordering as Evidence of Construct Validity for Assessments of Developmental Attributes

Peer reviewed

Direct link

Stephen Humphry; Paul Montuoro; Carolyn Maxwell – Journal of Psychoeducational Assessment, 2024

This article builds upon a proiminent definition of construct validity that focuses on variation in attributes causing variation in measurement outcomes. This article synthesizes the defintion and uses Rasch measurement modeling to explicate a modified conceptualization of construct validity for assessments of developmental attributes. If…

Descriptors: Construct Validity, Measurement Techniques, Developmental Stages, Item Analysis

Determining Cloze Item Difficulty from Item and Passage Characteristics across Different Learner Backgrounds

Peer reviewed

Direct link

Trace, Jonathan; Brown, James Dean; Janssen, Gerriet; Kozhevnikova, Liudmila – Language Testing, 2017

Cloze tests have been the subject of numerous studies regarding their function and use in both first language and second language contexts (e.g., Jonz & Oller, 1994; Watanabe & Koyama, 2008). From a validity standpoint, one area of investigation has been the extent to which cloze tests measure reading ability beyond the sentence level.…

Descriptors: Cloze Procedure, Language Tests, Test Items, Item Analysis

Responsiveness to Self-Report Questions about Loneliness: A Comparison of Mainstream and Intellectual Disability-Specific Instruments

Peer reviewed

Direct link

Stancliffe, R. J.; Wilson, N. J.; Bigby, C.; Balandin, S.; Craig, D. – Journal of Intellectual Disability Research, 2014

Background: We compared responsiveness to two self-report assessments of loneliness: the "UCLA Loneliness Scale" (UCLALS) designed for the general community, and the "Modified Worker Loneliness Questionnaire" (MWLQ) designed for people with intellectual disability (ID). Methods: Participants were 56 older adults with…

Descriptors: Measurement Techniques, Psychological Patterns, Measures (Individuals), Older Adults

Adaptation of an Instrument for Measuring the Cognitive Complexity of Organic Chemistry Exam Items

Peer reviewed

Direct link

Raker, Jeffrey R.; Trate, Jaclyn M.; Holme, Thomas A.; Murphy, Kristen – Journal of Chemical Education, 2013

Experts use their domain expertise and knowledge of examinees' ability levels as they write test items. The expert test writer can then estimate the difficulty of the test items subjectively. However, an objective method for assigning difficulty to a test item would capture the cognitive demands imposed on the examinee as well as be…

Descriptors: Organic Chemistry, Test Items, Item Analysis, Difficulty Level

Analysis of the Latin Square Task with Linear Logistic Test Models

Peer reviewed

Direct link

Zeuch, Nina; Holling, Heinz; Kuhn, Jorg-Tobias – Learning and Individual Differences, 2011

The Latin Square Task (LST) was developed by Birney, Halford, and Andrews [Birney, D. P., Halford, G. S., & Andrews, G. (2006). Measuring the influence of cognitive complexity on relational reasoning: The development of the Latin Square Task. Educational and Psychological Measurement, 66, 146-171.] and represents a non-domain specific,…

Descriptors: Difficulty Level, Geometric Concepts, Item Response Theory, Item Analysis

Afraid Not: Student Performance versus Perception Based on Exam Question Format

Peer reviewed

Direct link

Laprise, Shari L. – College Teaching, 2012

Successful exam composition can be a difficult task. Exams should not only assess student comprehension, but be learning tools in and of themselves. In a biotechnology course delivered to nonmajors at a business college, objective multiple-choice test questions often require students to choose the exception or "not true" choice. Anecdotal student…

Descriptors: Feedback (Response), Test Items, Multiple Choice Tests, Biotechnology

Creating a K-12 Adaptive Test: Examining the Stability of Item Parameter Estimates and Measurement Scales

Peer reviewed

Direct link

Kingsbury, G. Gage; Wise, Steven L. – Journal of Applied Testing Technology, 2011

Development of adaptive tests used in K-12 settings requires the creation of stable measurement scales to measure the growth of individual students from one grade to the next, and to measure change in groups from one year to the next. Accountability systems like No Child Left Behind require stable measurement scales so that accountability has…

Descriptors: Elementary Secondary Education, Adaptive Testing, Academic Achievement, Measures (Individuals)

The Relative Difficulty Ratio--A Test and Item Index.

PDF pending restoration

Frisbie, David A. – 1980

The development of a new technique, the Relative Difficulty Ratio (RDR), is described, as well as how it can be used to determine the difficulty level of a test so that meaningful inter-test difficulty comparisons can be made. Assumptions made in computing RDR include: 1) each item must be scored dichotomously with only one answer choice keyed as…

Descriptors: Difficulty Level, Item Analysis, Measurement Techniques, Scores

The Development of K-8 Progress Monitoring Measures in Mathematics for Use with the 2% and General Education Populations: Grade 1. Technical Report # 0919

Download full text

Alonzo, Julie; Tindal, Gerald – Behavioral Research and Teaching, 2009

In this technical report, we describe the development and piloting of a series of mathematics progress monitoring measures intended for use with students in grade 1. These measures, available as part of easyCBM [TM], an online progress monitoring assessment system, were developed in 2008 and administered to approximately 2800 students from schools…

Descriptors: Academic Achievement, Research Reports, Grade 1, Outcome Measures

Determining Learning Sequences from a Difficulty-Scaling of Test Items.

Download full text

Smith, Donald M. – 1974

The concept of scaled achievement tests is discussed and a method of selecting those items of a test that form the most scalable (i.e., having the highest coefficient of reproducibility) subset is presented. Sometimes called a monotonic-deterministic model, this type of test assumes that the test items may be sequentially ordered. To determine the…

Descriptors: Achievement Tests, Arithmetic, Difficulty Level, Item Analysis

A Theoretical Study of the Measurement Effectiveness of Flexilevel Tests

Peer reviewed

Lord, Frederic M. – Educational and Psychological Measurement, 1971

A number of empirical studies are suggested to answer certain questions in connection with flexilevel tests. (MS)

Descriptors: Comparative Analysis, Difficulty Level, Guessing (Tests), Item Analysis

Estimation of the Operating Characteristics When the Test Information of the Old Test Is Not Constant. I: Rationale. Research Report 80-2.

Samejima, Fumiko – 1980

Many combinations of a method and an approach for estimating the operating characteristics of the graded item responses, without assuming any mathematical forms, have been produced. In these methods, a set of items whose characteristics are known, or Old Test, is used, which has a large, constant amount of test information throughout the interval…

Descriptors: Difficulty Level, Item Analysis, Latent Trait Theory, Least Squares Statistics

Robbins-Monro Procedures for Tailored Testing

Peer reviewed

Lord, Frederic M. – Educational and Psychological Measurement, 1971

Descriptors: Ability, Adaptive Testing, Computer Oriented Programs, Difficulty Level

A Theoretical Study of the Measurement Effectiveness of Flexilevel Tests.

Download full text

Lord, Frederic M. – 1971

A flexilevel test is found to be inferior to a peaked conventional test for measuring examinees in the middle of the ability range, superior for examinees at the extremes. Throughout the entire range of ability, a flexilevel test is much superior to any conventional test that attempts to provide accurate measurement at both extremes. See also ED…

Descriptors: Ability, Comparative Analysis, Difficulty Level, Guessing (Tests)

Previous Page | Next Page »

Pages: 1 | 2

Lord, Frederic M.	3
Alonzo, Julie	1
Balandin, S.	1
Bigby, C.	1
Brown, James Dean	1
Carolyn Maxwell	1
Cliff, Norman	1
Craig, D.	1
Frisbie, David A.	1
Holling, Heinz	1
Holme, Thomas A.	1
Ingebo, George S.	1
Izard, J. F.	1
Janssen, Gerriet	1
Kingsbury, G. Gage	1
Kirsch, Irwin S.	1
Kozhevnikova, Liudmila	1
Kuhn, Jorg-Tobias	1
Laprise, Shari L.	1
Lockheed, Marlaine E.	1
Murphy, Kristen	1
Paul Montuoro	1
Rachel Lee	1
Raker, Jeffrey R.	1
More ▼