ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	7

Descriptor

Item Analysis	14
Scores	14
Test Theory	14
Test Items	8
Statistical Analysis	6
Test Validity	5
Latent Trait Theory	4
Test Construction	4
Comparative Analysis	3
Criterion Referenced Tests	3
Difficulty Level	3
Goodness of Fit	3
Mathematical Models	3
Postsecondary Education	3
Test Reliability	3
Career Development	2
Computation	2
Computer Assisted Testing	2
Equated Scores	2
Guidelines	2
Mastery Tests	2
Measurement	2
Raw Scores	2
Simulation	2
Test Content	2
More ▼

Source

Behavioral Research and…	1
Educational Measurement:…	1
Educational Researcher	1
International Journal of…	1
Journal of Educational…	1
Language Testing	1
Physical Review Physics…	1

Publication Type

Reports - Research	10
Journal Articles	6
Speeches/Meeting Papers	4
Numerical/Quantitative Data	2
Reports - Descriptive	2
Opinion Papers	1
Reference Materials -…	1

Education Level

Elementary Education	1
Elementary Secondary Education	1
Grade 3	1
Grade 4	1
Grade 5	1
Grade 6	1
Grade 7	1
Grade 8	1
Higher Education	1
Middle Schools	1

Audience

Researchers	2
Teachers	1

Location

Europe

Laws, Policies, & Programs

Elementary and Secondary…

Assessments and Surveys

Armed Services Vocational…	1
Test of English as a Foreign…	1

What Works Clearinghouse Rating

Showing all 14 results Save | Export

Classical Item Analysis from a Signal Detection Perspective

Peer reviewed

Direct link

DeCarlo, Lawrence T. – Journal of Educational Measurement, 2023

A conceptualization of multiple-choice exams in terms of signal detection theory (SDT) leads to simple measures of item difficulty and item discrimination that are closely related to, but also distinct from, those used in classical item analysis (CIA). The theory defines a "true split," depending on whether or not examinees know an item,…

Descriptors: Multiple Choice Tests, Test Items, Item Analysis, Test Wiseness

Facilitating the Interpretation of English Language Proficiency Scores: Combining Scale Anchoring and Test Score Mapping Methodologies

Peer reviewed

Direct link

Powers, Donald; Schedl, Mary; Papageorgiou, Spiros – Language Testing, 2017

The aim of this study was to develop, for the benefit of both test takers and test score users, enhanced "TOEFL ITP"® test score reports that go beyond the simple numerical scores that are currently reported. To do so, we applied traditional scale anchoring (proficiency scaling) to item difficulty data in order to develop performance…

Descriptors: English (Second Language), Second Language Learning, Language Proficiency, Scores

Gender Fairness within the Force Concept Inventory

Peer reviewed

Direct link

Traxler, Adrienne; Henderson, Rachel; Stewart, John; Stewart, Gay; Papak, Alexis; Lindell, Rebecca – Physical Review Physics Education Research, 2018

Research on the test structure of the Force Concept Inventory (FCI) has largely ignored gender, and research on FCI gender effects (often reported as "gender gaps") has seldom interrogated the structure of the test. These rarely crossed streams of research leave open the possibility that the FCI may not be structurally valid across…

Descriptors: Physics, Science Instruction, Sex Fairness, Gender Differences

Teaching Introductory Measurement: Suggestions for What to Include and How to Motivate Students

Peer reviewed

Direct link

Bandalos, Deborah L.; Kopp, Jason P. – Educational Measurement: Issues and Practice, 2012

In this article, we discuss the importance of measurement literacy and some issues encountered in teaching introductory measurement courses. We present results from a survey of introductory measurement instructors, including information about the topics included in such courses and the amount of time spent on each. Topics that were included by the…

Descriptors: Class Activities, Motivation Techniques, Item Analysis, Test Theory

Accessibility Theory for Enhancing the Validity of Test Results for Students with Special Needs

Peer reviewed

Direct link

Beddow, Peter A. – International Journal of Disability, Development and Education, 2012

In the arena of educational testing, accessibility refers to the degree to which students are given the opportunity to participate in and engage a test. Accessibility theory is a model for examining the interactions between the test-taker and the test itself and defining how they may decrease some students' access to the test event, ultimately…

Descriptors: Test Results, Test Items, Educational Testing, Scores

On Validity Theory and Test Validation

Peer reviewed

Direct link

Sireci, Stephen G. – Educational Researcher, 2007

Lissitz and Samuelsen (2007) propose a new framework for conceptualizing test validity that separates analysis of test properties from analysis of the construct measured. In response, the author of this article reviews fundamental characteristics of test validity, drawing largely from seminal writings as well as from the accepted standards. He…

Descriptors: Test Content, Test Validity, Guidelines, Test Items

Instrument Development Procedures for Mathematics Measures. Technical Report Number 08-02

Download full text

Jung, Eunju; Liu, Kimy; Ketterlin-Geller, Leanne R.; Tindal, Gerald – Behavioral Research and Teaching, 2008

The purpose of this study was to develop general outcome measures (GOM) in mathematics so that teachers could focus their instruction on needed prerequisite skills. We describe in detail, the manner in which content-related evidence was established and then present a number of statistical analyses conducted to evaluate the technical adequacy of…

Descriptors: Item Analysis, Test Construction, Test Theory, Mathematics Tests

Alternative Procedures for Collecting and Analyzing Job-Relatedness Judgments in NTE Test-Appraisal Studies.

Carlson, Robert E. – 1988

Early National Teacher Examinations test-appraisal studies focused only on the extent to which the content covered on a test matched the content of teacher education curricula. Recently, there has been a shift in emphasis from adequacy-of-preparation, which is now seen by many as irrelevant to establishing the appropriateness of a teacher…

Descriptors: Content Validity, Item Analysis, Job Performance, Licensing Examinations (Professions)

Comparison of Traditional and Latent Trait Procedures in Analysis and Selection of Rating Scale Items.

Gamache, LeAnn M. – 1983

Scales constructed under procedures and criteria outlined by the various traditional and latent trait methods were examined as to whether they varied in characteristics related to scale quality. Scales were constructed from a common pool of items analyzed in full form according to Likert and a one-parameter Rasch model for non-dichotomous data.…

Descriptors: Comparative Analysis, Correlation, Higher Education, Item Analysis

Latent Trait Approach to Domain Score Estimation.

Phillips, Gary W. – 1982

This paper presents an introduction to the use of latent trait models for the estimation of domain scores. It was shown that these models provided an advantage over classical test theory and binomial error models in that unbiased estimates of true domain scores could be obtained even when items were not randomly selected from a universe of items.…

Descriptors: Comparative Analysis, Criterion Referenced Tests, Estimation (Mathematics), Goodness of Fit

Discrimination Indices Commonly Used in Military Training Environments: Effects of Departures from Normal Distributions.

Download full text

Sarvela, Paul D. – 1986

Four discrimination indices were compared, using score distributions which were normal, bimodal, and negatively skewed. The score distributions were systematically varied to represent the common circumstances of a military training situation using criterion-referenced mastery tests. Three 20-item tests were administered to 110 simulated subjects.…

Descriptors: Comparative Analysis, Criterion Referenced Tests, Item Analysis, Mastery Tests

Armed Services Vocational Aptitude Battery: Development of Forms 11, 12, and 13.

Download full text

Prestwood, J. Stephen; And Others – 1985

Six new forms of Armed Services Vocational Aptitude Battery (ASVAB) were developed. These new forms were equated to a standard reference test, ASVAB 8a, using normative data based on a 1980 weighted probability sample of American youth, ages 18-23. Equating allows the services to report the distributions of examinee ability on a common metric or…

Descriptors: Ability Identification, Aptitude Tests, Armed Forces, Data Collection

Time-Score Analysis in Criterion-Referenced Tests. Final Report.

Download full text

Tatsuoka, Kikumi K.; Tatsuoka, Maurice M. – 1978

The family of Weibull distributions was investigated as a model for the distributions of response times for items in computer-based criterion-referenced tests. The fit of these distributions were, with a few exceptions, good to excellent according to the Kolmogorov-Smirnov test. For a few relatively simple items, the two-parameter gamma…

Descriptors: Career Development, Computer Assisted Instruction, Computer Assisted Testing, Criterion Referenced Tests

Bibliography of Papers on Latent Trait Assessment.

Cohen, Allan S., Comp. – 1979

This partially annotated bibliography of journal articles, dissertations, convention papers, research reports, and a few books and unpublished manuscripts provides a comprehensive coverage of work on latent trait theory and practice. Documents are arranged alphabetically by author. The period covered ranges from the early 1950's to the present.…

Descriptors: Attitude Measures, Career Development, Computer Assisted Testing, Computer Programs

Bandalos, Deborah L.	1
Beddow, Peter A.	1
Carlson, Robert E.	1
Cohen, Allan S., Comp.	1
DeCarlo, Lawrence T.	1
Gamache, LeAnn M.	1
Henderson, Rachel	1
Jung, Eunju	1
Ketterlin-Geller, Leanne R.	1
Kopp, Jason P.	1
Lindell, Rebecca	1
Liu, Kimy	1
Papageorgiou, Spiros	1
Papak, Alexis	1
Phillips, Gary W.	1
Powers, Donald	1
Prestwood, J. Stephen	1
Sarvela, Paul D.	1
Schedl, Mary	1
Sireci, Stephen G.	1
Stewart, Gay	1
Stewart, John	1
Tatsuoka, Kikumi K.	1
Tatsuoka, Maurice M.	1
More ▼