ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	4

Descriptor

Difficulty Level	10
Test Validity	10
Test Items	6
Psychometrics	4
Foreign Countries	3
Item Analysis	3
Measures (Individuals)	3
Test Reliability	3
Evaluation Methods	2
Higher Education	2
Motivation	2
Response Style (Tests)	2
Test Construction	2
Ability	1
Anxiety	1
Aptitude Tests	1
Behavior	1
College Students	1
Comparative Analysis	1
Correlation	1
Court Judges	1
Depression (Psychology)	1
Disadvantaged Youth	1
Elementary School Students	1
Emergent Literacy	1
More ▼

Source

Educational and Psychological…

Publication Type

Journal Articles	8
Reports - Research	6
Reports - Evaluative	2

Education Level

Higher Education	2
Early Childhood Education	1
Elementary Education	1
Postsecondary Education	1
Preschool Education	1
Secondary Education	1

Audience

Location

Germany	1
Greece	1

Laws, Policies, & Programs

Assessments and Surveys

Childrens Manifest Anxiety…	1
Raven Progressive Matrices	1
Rosenberg Self Esteem Scale	1

What Works Clearinghouse Rating

Showing all 10 results Save | Export

On the Relationship between Item Stem Formulation and Criterion Validity of Multiple-Component Measuring Instruments

Peer reviewed

Direct link

Menold, Natalja; Raykov, Tenko – Educational and Psychological Measurement, 2022

The possible dependency of criterion validity on item formulation in a multicomponent measuring instrument is examined. The discussion is concerned with evaluation of the differences in criterion validity between two or more groups (populations/subpopulations) that have been administered instruments with items having differently formulated item…

Descriptors: Test Items, Measures (Individuals), Test Validity, Difficulty Level

Survey Satisficing Inflates Reliability and Validity Measures: An Experimental Comparison of College and Amazon Mechanical Turk Samples

Peer reviewed

Direct link

Hamby, Tyler; Taylor, Wyn – Educational and Psychological Measurement, 2016

This study examined the predictors and psychometric outcomes of survey satisficing, wherein respondents provide quick, "good enough" answers (satisficing) rather than carefully considered answers (optimizing). We administered surveys to university students and respondents--half of whom held college degrees--from a for-pay survey website,…

Descriptors: Surveys, Test Reliability, Test Validity, Comparative Analysis

Assessing Validity of Measurement in Learning Disabilities Using Hierarchical Generalized Linear Modeling: The Roles of Anxiety and Motivation

Peer reviewed

Direct link

Sideridis, Georgios D. – Educational and Psychological Measurement, 2016

The purpose of the present studies was to test the hypothesis that the psychometric characteristics of ability scales may be significantly distorted if one accounts for emotional factors during test taking. Specifically, the present studies evaluate the effects of anxiety and motivation on the item difficulties of the Rasch model. In Study 1, the…

Descriptors: Learning Disabilities, Test Validity, Measures (Individuals), Hierarchical Linear Modeling

Developing Short Forms of the EARLI Numeracy Measures: Comparison of Item Selection Methods

Peer reviewed

Direct link

Lei, Pui-Wa; Wu, Qiong; DiPerna, James C.; Morgan, Paul L. – Educational and Psychological Measurement, 2009

Currently, few measures are available to monitor young children's progress in acquiring key early academic skills. In response to this need, the authors have begun developing measures (i.e., the Early Arithmetic, Reading and Learning Indicators, or EARLI) of preschoolers' numeracy skills. To accurately and efficiently monitor acquisition of early…

Descriptors: Preschool Children, Measures (Individuals), Numeracy, Emergent Literacy

Answer Changing on Objective Tests: Some Implications for Test Validity

Peer reviewed

Jacobs, Stanley S. – Educational and Psychological Measurement, 1972

Data quite clearly indicated that students should be allowed and encouraged to reconsider and evaluate their responses to objective test items. (Author)

Descriptors: Difficulty Level, Objective Tests, Response Style (Tests), Tables (Data)

Logical Versus Empirical Estimates of Item Difficulty

Peer reviewed

Quereshi, M. Y.; Fisher, Thomas L. – Educational and Psychological Measurement, 1977

Logical estimates of item difficulty made by judges were compared to empirical estimates derived from a test administration. Results indicated substantial correspondence between logical and empirical estimates, and substantial variation among judges. Further, the more elaborate the system used by judges to make estimates, the more accurate the…

Descriptors: Court Judges, Difficulty Level, Evaluation Methods, Item Analysis

Nonfunctioning Options: A Closer Look.

Peer reviewed

Cizek, Gregory J.; Robinson, K. Lynne; O'Day, Denis M. – Educational and Psychological Measurement, 1998

The effect of removing nonfunctioning items from multiple-choice tests was studied by examining change in difficulty, discrimination, and dimensionality. Results provide additional support for the benefits of eliminating nonfunctioning options, such as enhanced score reliability, reduced testing time, potential for broader domain sampling, and…

Descriptors: Difficulty Level, Multiple Choice Tests, Sampling, Scores

The Effects of Skipping Over More Difficult Items on Time-Limited Tests: Implications for Test Validity.

Peer reviewed

Rindler, Susan Ellerin – Educational and Psychological Measurement, 1980

A short verbal aptitude test was administered under varying time limits with answer sheets specially designed to allow items that had been skipped to be identified. It appeared advantageous for the more able (based on grade point averages) but disadvantageous for the less able to skip items. (Author/RL)

Descriptors: Aptitude Tests, Difficulty Level, Higher Education, Response Style (Tests)

Reliability and Validity of a Priori Estimates of Item Characteristics for an Examination of Health Science Information.

Peer reviewed

Willoughby, T. Lee – Educational and Psychological Measurement, 1980

The reliability and validity of a priori estimates of item characteristics are assessed. Results suggest that judges can make a modest contribution to estimation prior to actual administration. (Author/GK)

Descriptors: Difficulty Level, Higher Education, Item Analysis, Medical School Faculty

Rasch Rating Scale Modeling of the Korean Version of the Beck Depression Inventory

Peer reviewed

Direct link

Hong, Sehee; Wong, Eunice C. – Educational and Psychological Measurement, 2005

The Beck Depression Inventory (BDI) is one of the most frequently used instruments in the study of depression both within and outside of the United States. Though developed primarily with European American clinical populations, the BDI has been applied in nonclinical and non-Western samples. To determine whether such a practice is warranted, the…

Descriptors: Difficulty Level, Rating Scales, Depression (Psychology), Evaluation Methods

Cizek, Gregory J.	1
DiPerna, James C.	1
Fisher, Thomas L.	1
Hamby, Tyler	1
Hong, Sehee	1
Jacobs, Stanley S.	1
Lei, Pui-Wa	1
Menold, Natalja	1
Morgan, Paul L.	1
O'Day, Denis M.	1
Quereshi, M. Y.	1
Raykov, Tenko	1
Rindler, Susan Ellerin	1
Robinson, K. Lynne	1
Sideridis, Georgios D.	1
Taylor, Wyn	1
Willoughby, T. Lee	1
Wong, Eunice C.	1
Wu, Qiong	1
More ▼