Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 3 |
Since 2006 (last 20 years) | 4 |
Descriptor
Difficulty Level | 10 |
Test Validity | 10 |
Test Items | 6 |
Psychometrics | 4 |
Foreign Countries | 3 |
Item Analysis | 3 |
Measures (Individuals) | 3 |
Test Reliability | 3 |
Evaluation Methods | 2 |
Higher Education | 2 |
Motivation | 2 |
More ▼ |
Source
Educational and Psychological… | 10 |
Author
Cizek, Gregory J. | 1 |
DiPerna, James C. | 1 |
Fisher, Thomas L. | 1 |
Hamby, Tyler | 1 |
Hong, Sehee | 1 |
Jacobs, Stanley S. | 1 |
Lei, Pui-Wa | 1 |
Menold, Natalja | 1 |
Morgan, Paul L. | 1 |
O'Day, Denis M. | 1 |
Quereshi, M. Y. | 1 |
More ▼ |
Publication Type
Journal Articles | 8 |
Reports - Research | 6 |
Reports - Evaluative | 2 |
Education Level
Higher Education | 2 |
Early Childhood Education | 1 |
Elementary Education | 1 |
Postsecondary Education | 1 |
Preschool Education | 1 |
Secondary Education | 1 |
Audience
Laws, Policies, & Programs
Assessments and Surveys
Childrens Manifest Anxiety… | 1 |
Raven Progressive Matrices | 1 |
Rosenberg Self Esteem Scale | 1 |
What Works Clearinghouse Rating
Menold, Natalja; Raykov, Tenko – Educational and Psychological Measurement, 2022
The possible dependency of criterion validity on item formulation in a multicomponent measuring instrument is examined. The discussion is concerned with evaluation of the differences in criterion validity between two or more groups (populations/subpopulations) that have been administered instruments with items having differently formulated item…
Descriptors: Test Items, Measures (Individuals), Test Validity, Difficulty Level
Hamby, Tyler; Taylor, Wyn – Educational and Psychological Measurement, 2016
This study examined the predictors and psychometric outcomes of survey satisficing, wherein respondents provide quick, "good enough" answers (satisficing) rather than carefully considered answers (optimizing). We administered surveys to university students and respondents--half of whom held college degrees--from a for-pay survey website,…
Descriptors: Surveys, Test Reliability, Test Validity, Comparative Analysis
Sideridis, Georgios D. – Educational and Psychological Measurement, 2016
The purpose of the present studies was to test the hypothesis that the psychometric characteristics of ability scales may be significantly distorted if one accounts for emotional factors during test taking. Specifically, the present studies evaluate the effects of anxiety and motivation on the item difficulties of the Rasch model. In Study 1, the…
Descriptors: Learning Disabilities, Test Validity, Measures (Individuals), Hierarchical Linear Modeling
Lei, Pui-Wa; Wu, Qiong; DiPerna, James C.; Morgan, Paul L. – Educational and Psychological Measurement, 2009
Currently, few measures are available to monitor young children's progress in acquiring key early academic skills. In response to this need, the authors have begun developing measures (i.e., the Early Arithmetic, Reading and Learning Indicators, or EARLI) of preschoolers' numeracy skills. To accurately and efficiently monitor acquisition of early…
Descriptors: Preschool Children, Measures (Individuals), Numeracy, Emergent Literacy

Jacobs, Stanley S. – Educational and Psychological Measurement, 1972
Data quite clearly indicated that students should be allowed and encouraged to reconsider and evaluate their responses to objective test items. (Author)
Descriptors: Difficulty Level, Objective Tests, Response Style (Tests), Tables (Data)

Quereshi, M. Y.; Fisher, Thomas L. – Educational and Psychological Measurement, 1977
Logical estimates of item difficulty made by judges were compared to empirical estimates derived from a test administration. Results indicated substantial correspondence between logical and empirical estimates, and substantial variation among judges. Further, the more elaborate the system used by judges to make estimates, the more accurate the…
Descriptors: Court Judges, Difficulty Level, Evaluation Methods, Item Analysis

Cizek, Gregory J.; Robinson, K. Lynne; O'Day, Denis M. – Educational and Psychological Measurement, 1998
The effect of removing nonfunctioning items from multiple-choice tests was studied by examining change in difficulty, discrimination, and dimensionality. Results provide additional support for the benefits of eliminating nonfunctioning options, such as enhanced score reliability, reduced testing time, potential for broader domain sampling, and…
Descriptors: Difficulty Level, Multiple Choice Tests, Sampling, Scores

Rindler, Susan Ellerin – Educational and Psychological Measurement, 1980
A short verbal aptitude test was administered under varying time limits with answer sheets specially designed to allow items that had been skipped to be identified. It appeared advantageous for the more able (based on grade point averages) but disadvantageous for the less able to skip items. (Author/RL)
Descriptors: Aptitude Tests, Difficulty Level, Higher Education, Response Style (Tests)

Willoughby, T. Lee – Educational and Psychological Measurement, 1980
The reliability and validity of a priori estimates of item characteristics are assessed. Results suggest that judges can make a modest contribution to estimation prior to actual administration. (Author/GK)
Descriptors: Difficulty Level, Higher Education, Item Analysis, Medical School Faculty
Hong, Sehee; Wong, Eunice C. – Educational and Psychological Measurement, 2005
The Beck Depression Inventory (BDI) is one of the most frequently used instruments in the study of depression both within and outside of the United States. Though developed primarily with European American clinical populations, the BDI has been applied in nonclinical and non-Western samples. To determine whether such a practice is warranted, the…
Descriptors: Difficulty Level, Rating Scales, Depression (Psychology), Evaluation Methods