Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 3 |
Since 2006 (last 20 years) | 6 |
Descriptor
Difficulty Level | 11 |
Psychometrics | 11 |
Test Items | 8 |
Comparative Analysis | 5 |
Item Response Theory | 4 |
Test Validity | 4 |
Item Analysis | 3 |
College Students | 2 |
Evaluation Methods | 2 |
Foreign Countries | 2 |
Goodness of Fit | 2 |
More ▼ |
Source
Educational and Psychological… | 11 |
Author
Andrews, Glenda | 1 |
Birney, Damian P. | 1 |
Fisher, Thomas L. | 1 |
Halford, Graeme S. | 1 |
Hamby, Tyler | 1 |
Hong, Sehee | 1 |
Kubinger, Klaus D. | 1 |
Lenhard, Alexandra | 1 |
Lenhard, Wolfgang | 1 |
Lord, Frederic M. | 1 |
Quereshi, M. Y. | 1 |
More ▼ |
Publication Type
Journal Articles | 9 |
Reports - Research | 4 |
Reports - Descriptive | 2 |
Reports - Evaluative | 2 |
Guides - Non-Classroom | 1 |
Audience
Location
Greece | 1 |
Laws, Policies, & Programs
Assessments and Surveys
Childrens Manifest Anxiety… | 1 |
Raven Progressive Matrices | 1 |
Rosenberg Self Esteem Scale | 1 |
What Works Clearinghouse Rating
Lenhard, Wolfgang; Lenhard, Alexandra – Educational and Psychological Measurement, 2021
The interpretation of psychometric test results is usually based on norm scores. We compared semiparametric continuous norming (SPCN) with conventional norming methods by simulating results for test scales with different item numbers and difficulties via an item response theory approach. Subsequently, we modeled the norm scores based on random…
Descriptors: Test Norms, Scores, Regression (Statistics), Test Items
Hamby, Tyler; Taylor, Wyn – Educational and Psychological Measurement, 2016
This study examined the predictors and psychometric outcomes of survey satisficing, wherein respondents provide quick, "good enough" answers (satisficing) rather than carefully considered answers (optimizing). We administered surveys to university students and respondents--half of whom held college degrees--from a for-pay survey website,…
Descriptors: Surveys, Test Reliability, Test Validity, Comparative Analysis
Sliter, Katherine A.; Zickar, Michael J. – Educational and Psychological Measurement, 2014
This study compared the functioning of positively and negatively worded personality items using item response theory. In Study 1, word pairs from the Goldberg Adjective Checklist were analyzed using the Graded Response Model. Across subscales, negatively worded items produced comparatively higher difficulty and lower discrimination parameters than…
Descriptors: Item Response Theory, Psychometrics, Personality Measures, Test Items
Sideridis, Georgios D. – Educational and Psychological Measurement, 2016
The purpose of the present studies was to test the hypothesis that the psychometric characteristics of ability scales may be significantly distorted if one accounts for emotional factors during test taking. Specifically, the present studies evaluate the effects of anxiety and motivation on the item difficulties of the Rasch model. In Study 1, the…
Descriptors: Learning Disabilities, Test Validity, Measures (Individuals), Hierarchical Linear Modeling
Kubinger, Klaus D. – Educational and Psychological Measurement, 2009
The linear logistic test model (LLTM) breaks down the item parameter of the Rasch model as a linear combination of some hypothesized elementary parameters. Although the original purpose of applying the LLTM was primarily to generate test items with specified item difficulty, there are still many other potential applications, which may be of use…
Descriptors: Models, Test Items, Psychometrics, Item Response Theory

Rogers, Paul W. – Educational and Psychological Measurement, 1978
Two procedures for the display of item analysis statistics are described. One procedure allows for investigation of difficulty; the second plots item difficulty against item discrimination. (Author/JKS)
Descriptors: Difficulty Level, Graphs, Guidelines, Item Analysis
Birney, Damian P.; Halford, Graeme S.; Andrews, Glenda – Educational and Psychological Measurement, 2006
Relational complexity (RC) theory conceptualizes an individual's processing capacity and a task's complexity along a common ordinal metric. The authors describe the development of the Latin Square Task (LST) that assesses the influence of RC on reasoning. The LST minimizes the role of knowledge and storage capacity and thus refines the…
Descriptors: Memory, Age Differences, Cognitive Processes, Psychometrics

Quereshi, M. Y.; Fisher, Thomas L. – Educational and Psychological Measurement, 1977
Logical estimates of item difficulty made by judges were compared to empirical estimates derived from a test administration. Results indicated substantial correspondence between logical and empirical estimates, and substantial variation among judges. Further, the more elaborate the system used by judges to make estimates, the more accurate the…
Descriptors: Court Judges, Difficulty Level, Evaluation Methods, Item Analysis
Hong, Sehee; Wong, Eunice C. – Educational and Psychological Measurement, 2005
The Beck Depression Inventory (BDI) is one of the most frequently used instruments in the study of depression both within and outside of the United States. Though developed primarily with European American clinical populations, the BDI has been applied in nonclinical and non-Western samples. To determine whether such a practice is warranted, the…
Descriptors: Difficulty Level, Rating Scales, Depression (Psychology), Evaluation Methods

Lord, Frederic M. – Educational and Psychological Measurement, 1971
A number of empirical studies are suggested to answer certain questions in connection with flexilevel tests. (MS)
Descriptors: Comparative Analysis, Difficulty Level, Guessing (Tests), Item Analysis

Tsai, Fu-Ju; Suen, Hoi K. – Educational and Psychological Measurement, 1993
Six methods of scoring multiple true-false items were compared in terms of reliabilities, difficulties, and discrimination. Results suggest that, for norm-referenced score interpretations, there is insufficient evidence to support any one of the methods as superior. For criterion-referenced score interpretations, effects of scoring method must be…
Descriptors: Comparative Analysis, Criterion Referenced Tests, Difficulty Level, Guessing (Tests)