ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	5
Since 2006 (last 20 years)	13

Descriptor

Statistical Analysis	37
Test Items	37
Test Theory	37
Test Construction	14
Item Analysis	13
Mathematical Models	9
Multiple Choice Tests	9
Psychometrics	9
Difficulty Level	8
Item Response Theory	8
Test Validity	8
Comparative Analysis	7
Achievement Tests	6
Scores	6
Criterion Referenced Tests	5
Measurement Techniques	5
Scoring	5
Simulation	5
Test Reliability	5
Testing Problems	5
Computation	4
Correlation	4
Error of Measurement	4
Latent Trait Theory	4
Reading Tests	4
More ▼

Source

Educational and Psychological…	4
ETS Research Report Series	3
Behavioral Research and…	1
Chemistry Education Research…	1
Current Issues in Education	1
International Journal of…	1
Journal of Educational and…	1
Journal of Interactive Online…	1
Journal of Pedagogical…	1
Marketing Education Review	1
Physical Review Physics…	1
ProQuest LLC	1
System	1
Teaching of Psychology	1
More ▼

Publication Type

Reports - Research	28
Journal Articles	17
Speeches/Meeting Papers	9
Reports - Descriptive	3
Reports - Evaluative	3
ERIC Digests in Full Text	2
ERIC Publications	2
Collected Works - Proceedings	1
Dissertations/Theses -…	1
Guides - Non-Classroom	1
Numerical/Quantitative Data	1
More ▼

Education Level

Higher Education	6
Postsecondary Education	5
Elementary Education	3
Grade 8	3
Middle Schools	3
Grade 3	2
Grade 4	2
Junior High Schools	2
Secondary Education	2
Grade 5	1
Grade 6	1
Grade 7	1
High Schools	1
Intermediate Grades	1
More ▼

Audience

Researchers

Location

Germany	1
Texas	1
Turkey	1

Laws, Policies, & Programs

Assessments and Surveys

Comprehensive Tests of Basic…	2
California Achievement Tests	1
Defining Issues Test	1
Law School Admission Test	1
National Assessment of…	1
Piers Harris Childrens Self…	1
Tennessee Self Concept Scale	1

What Works Clearinghouse Rating

Showing 1 to 15 of 37 results Save | Export

On True Score Evaluation Using Item Response Theory Modeling

Peer reviewed

Direct link

Raykov, Tenko; Dimitrov, Dimiter M.; Marcoulides, George A.; Harrison, Michael – Educational and Psychological Measurement, 2019

Building on prior research on the relationships between key concepts in item response theory and classical test theory, this note contributes to highlighting their important and useful links. A readily and widely applicable latent variable modeling procedure is discussed that can be used for point and interval estimation of the individual person…

Descriptors: True Scores, Item Response Theory, Test Items, Test Theory

Effects of Various Simulation Conditions on Latent-Trait Estimates: A Simulation Study

Peer reviewed
PDF on ERIC

Download full text

Kogar, Hakan – International Journal of Assessment Tools in Education, 2018

The aim of this simulation study, determine the relationship between true latent scores and estimated latent scores by including various control variables and different statistical models. The study also aimed to compare the statistical models and determine the effects of different distribution types, response formats and sample sizes on latent…

Descriptors: Simulation, Context Effect, Computation, Statistical Analysis

Determination of Differential Item Functioning (DIF) According to SIBTEST, Lord's [Chi-squared], Raju's Area Measurement and Breslow-Day Methods

Peer reviewed
PDF on ERIC

Download full text

Ayva Yörü, Fatma Gökçen; Atar, Hakan Yavuz – Journal of Pedagogical Research, 2019

The aim of this study is to examine whether the items in the mathematics subtest of the Centralized High School Entrance Placement Test [HSEPT] administered in 2012 by the Ministry of National Education in Turkey show DIF according to gender and type of school. For this purpose, SIBTEST, Breslow-Day, Lord's [chi-squared] and Raju's area…

Descriptors: Test Bias, Mathematics Tests, Test Items, Gender Differences

Gender Fairness within the Force Concept Inventory

Peer reviewed

Direct link

Traxler, Adrienne; Henderson, Rachel; Stewart, John; Stewart, Gay; Papak, Alexis; Lindell, Rebecca – Physical Review Physics Education Research, 2018

Research on the test structure of the Force Concept Inventory (FCI) has largely ignored gender, and research on FCI gender effects (often reported as "gender gaps") has seldom interrogated the structure of the test. These rarely crossed streams of research leave open the possibility that the FCI may not be structurally valid across…

Descriptors: Physics, Science Instruction, Sex Fairness, Gender Differences

An Information-Correction Method for Testlet-Based Test Analysis: From the Perspectives of Item Response Theory and Generalizability Theory. Research Report. ETS RR-17-27

Peer reviewed
PDF on ERIC

Download full text

Li, Feifei – ETS Research Report Series, 2017

An information-correction method for testlet-based tests is introduced. This method takes advantage of both generalizability theory (GT) and item response theory (IRT). The measurement error for the examinee proficiency parameter is often underestimated when a unidimensional conditional-independence IRT model is specified for a testlet dataset. By…

Descriptors: Item Response Theory, Generalizability Theory, Tests, Error of Measurement

An Inventory for Measuring Student Teachers' Knowledge of Chemical Representations: Design, Validation, and Psychometric Analysis

Peer reviewed

Direct link

Taskin, V.; Bernholt, S.; Parchmann, I. – Chemistry Education Research and Practice, 2015

Chemical representations play an important role in helping learners to understand chemical contents. Thus, dealing with chemical representations is a necessity for learning chemistry, but at the same time, it presents a great challenge to learners. Due to this great challenge, it is not surprising that numerous national and international studies…

Descriptors: Student Teachers, Knowledge Level, Science Instruction, Chemistry

Criterion-Referenced Exit Examinations: An Institution's Internal Process for Psychometric Analysis

Peer reviewed

Direct link

Lieneck, Cristian; Morrison, Eileen; Price, Larry – Current Issues in Education, 2013

The Texas State University-San Marcos undergraduate healthcare administration program requires all bachelors of health administration (BHA) students to pass a comprehensive examination to demonstrate their knowledge of specific core competencies. This also demonstrates completion of their didactic coursework in order to enter a practical…

Descriptors: Exit Examinations, Health Services, Administrator Education, Psychometrics

Evaluating IRT- and CTT-Based Methods of Estimating Classification Consistency and Accuracy Indices from Single Administrations

Direct link

Deng, Nina – ProQuest LLC, 2011

Three decision consistency and accuracy (DC/DA) methods, the Livingston and Lewis (LL) method, LEE method, and the Hambleton and Han (HH) method, were evaluated. The purposes of the study were: (1) to evaluate the accuracy and robustness of these methods, especially when their assumptions were not well satisfied, (2) to investigate the "true"…

Descriptors: Item Response Theory, Test Theory, Computation, Classification

An Innovative Excel Application to Improve Exam Reliability in Marketing Courses

Peer reviewed

Direct link

Keller, Christopher M.; Kros, John F. – Marketing Education Review, 2011

Measures of survey reliability are commonly addressed in marketing courses. One statistic of reliability is "Cronbach's alpha." This paper presents an application of survey reliability as a reflexive application of multiple-choice exam validation. The application provides an interactive decision support system that incorporates survey item…

Descriptors: Test Validity, Marketing, Test Reliability, Multiple Choice Tests

An Equipercentile Version of the Levine Linear Observed-Score Equating Function Using the Methods of Kernel Equating. Research Report. ETS RR-07-14

Peer reviewed
PDF on ERIC

Download full text

von Davier, Alina A.; Fournier-Zajac, Stephanie; Holland, Paul W. – ETS Research Report Series, 2007

In the nonequivalent groups with anchor test (NEAT) design, there are several ways to use the information provided by the anchor in the equating process. One of the NEAT-design equating methods is the linear observed-score Levine method (Kolen & Brennan, 2004). It is based on a classical test theory model of the true scores on the test forms…

Descriptors: Equated Scores, Statistical Analysis, Test Items, Test Theory

Instrument Development Procedures for Mathematics Measures. Technical Report Number 08-02

Download full text

Jung, Eunju; Liu, Kimy; Ketterlin-Geller, Leanne R.; Tindal, Gerald – Behavioral Research and Teaching, 2008

The purpose of this study was to develop general outcome measures (GOM) in mathematics so that teachers could focus their instruction on needed prerequisite skills. We describe in detail, the manner in which content-related evidence was established and then present a number of statistical analyses conducted to evaluate the technical adequacy of…

Descriptors: Item Analysis, Test Construction, Test Theory, Mathematics Tests

Detecting Answer Copying when the Regular Response Process Follows a Known Response Model

Peer reviewed

Direct link

van der Linden, Wim J.; Sotaridona, Leonardo – Journal of Educational and Behavioral Statistics, 2006

A statistical test for detecting answer copying on multiple-choice items is presented. The test is based on the exact null distribution of the number of random matches between two test takers under the assumption that the response process follows a known response model. The null distribution can easily be generalized to the family of distributions…

Descriptors: Test Items, Multiple Choice Tests, Cheating, Responses

Item Response Theory and Classical Test Theory: An Empirical Comparison of Their Item/Person Statistics.

Peer reviewed

Fan, Xitao – Educational and Psychological Measurement, 1998

This study empirically examined the behaviors of item and person statistics derived from item response theory and classical test theory, focusing on item and person statistics and using a large-scale statewide assessment. Findings show that the person and item statistics from the two measurement frameworks are quite comparable. (SLD)

Descriptors: Item Response Theory, State Programs, Statistical Analysis, Test Items

ERGO: A New Approach to Multidimensional Item Analysis.

Peer reviewed

Reynolds, Thomas J. – Educational and Psychological Measurement, 1981

Cliff's Index "c" derived from an item dominance matrix is utilized in a clustering approach, termed extracting Reliable Guttman Orders (ERGO), to isolate Guttman-type item hierarchies. A comparison of factor analysis to the ERGO is made on social distance data involving multiple ethnic groups. (Author/BW)

Descriptors: Cluster Analysis, Difficulty Level, Factor Analysis, Item Analysis

The Language Tester's Statistical Toolbox.

Peer reviewed

Davidson, Fred – System, 2000

Statistical analysis tools in language testing are described, chiefly classical test theory and item response theory. Computer software for statistical analysis is briefly reviewed and divided into three tiers: commonly available; statistical packages; and specialty software. (Author/VWL)

Descriptors: Computer Software, Language Tests, Second Language Learning, Statistical Analysis

Previous Page | Next Page »

Pages: 1 | 2 | 3

Yen, Wendy M.	2
Atar, Hakan Yavuz	1
Ayva Yörü, Fatma Gökçen	1
Balch, William R.	1
Bernholt, S.	1
Broussard, Rolland L.	1
Bruno, James E.	1
Buchanan, Aaron	1
Davidson, Fred	1
Deng, Nina	1
Dimitrov, Dimiter M.	1
Dirkzwager, A.	1
Fan, Xitao	1
Fournier-Zajac, Stephanie	1
Frary, Robert B.	1
Haladyna, Tom	1
Harrison, Michael	1
Henderson, Rachel	1
Holland, Paul W.	1
Hutchinson, T. P.	1
Iran-Nejad, Asghar	1
Jung, Eunju	1
Kehoe, Jerard	1
Keller, Christopher M.	1
More ▼