ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	6

Descriptor

Difficulty Level	11
Psychometrics	11
Test Items	8
Comparative Analysis	5
Item Response Theory	4
Test Validity	4
Item Analysis	3
College Students	2
Evaluation Methods	2
Foreign Countries	2
Goodness of Fit	2
Guessing (Tests)	2
Measurement Techniques	2
Motivation	2
Scoring	2
Test Interpretation	2
Ability	1
Age	1
Age Differences	1
Age Groups	1
Alumni	1
Anxiety	1
Behavior	1
Cognitive Processes	1
Correlation	1
More ▼

Source

Educational and Psychological…

Publication Type

Journal Articles	9
Reports - Research	4
Reports - Descriptive	2
Reports - Evaluative	2
Guides - Non-Classroom	1

Education Level

Higher Education	3
Postsecondary Education	2
Elementary Education	1
Secondary Education	1

Audience

Location

Greece

Laws, Policies, & Programs

Assessments and Surveys

Childrens Manifest Anxiety…	1
Raven Progressive Matrices	1
Rosenberg Self Esteem Scale	1

What Works Clearinghouse Rating

Showing all 11 results Save | Export

Improvement of Norm Score Quality via Regression-Based Continuous Norming

Peer reviewed

Direct link

Lenhard, Wolfgang; Lenhard, Alexandra – Educational and Psychological Measurement, 2021

The interpretation of psychometric test results is usually based on norm scores. We compared semiparametric continuous norming (SPCN) with conventional norming methods by simulating results for test scales with different item numbers and difficulties via an item response theory approach. Subsequently, we modeled the norm scores based on random…

Descriptors: Test Norms, Scores, Regression (Statistics), Test Items

Survey Satisficing Inflates Reliability and Validity Measures: An Experimental Comparison of College and Amazon Mechanical Turk Samples

Peer reviewed

Direct link

Hamby, Tyler; Taylor, Wyn – Educational and Psychological Measurement, 2016

This study examined the predictors and psychometric outcomes of survey satisficing, wherein respondents provide quick, "good enough" answers (satisficing) rather than carefully considered answers (optimizing). We administered surveys to university students and respondents--half of whom held college degrees--from a for-pay survey website,…

Descriptors: Surveys, Test Reliability, Test Validity, Comparative Analysis

An IRT Examination of the Psychometric Functioning of Negatively Worded Personality Items

Peer reviewed

Direct link

Sliter, Katherine A.; Zickar, Michael J. – Educational and Psychological Measurement, 2014

This study compared the functioning of positively and negatively worded personality items using item response theory. In Study 1, word pairs from the Goldberg Adjective Checklist were analyzed using the Graded Response Model. Across subscales, negatively worded items produced comparatively higher difficulty and lower discrimination parameters than…

Descriptors: Item Response Theory, Psychometrics, Personality Measures, Test Items

Assessing Validity of Measurement in Learning Disabilities Using Hierarchical Generalized Linear Modeling: The Roles of Anxiety and Motivation

Peer reviewed

Direct link

Sideridis, Georgios D. – Educational and Psychological Measurement, 2016

The purpose of the present studies was to test the hypothesis that the psychometric characteristics of ability scales may be significantly distorted if one accounts for emotional factors during test taking. Specifically, the present studies evaluate the effects of anxiety and motivation on the item difficulties of the Rasch model. In Study 1, the…

Descriptors: Learning Disabilities, Test Validity, Measures (Individuals), Hierarchical Linear Modeling

Applications of the Linear Logistic Test Model in Psychometric Research

Peer reviewed

Direct link

Kubinger, Klaus D. – Educational and Psychological Measurement, 2009

The linear logistic test model (LLTM) breaks down the item parameter of the Rasch model as a linear combination of some hypothesized elementary parameters. Although the original purpose of applying the LLTM was primarily to generate test items with specified item difficulty, there are still many other potential applications, which may be of use…

Descriptors: Models, Test Items, Psychometrics, Item Response Theory

The Use of Control Charts to Display Item Analysis Data.

Peer reviewed

Rogers, Paul W. – Educational and Psychological Measurement, 1978

Two procedures for the display of item analysis statistics are described. One procedure allows for investigation of difficulty; the second plots item difficulty against item discrimination. (Author/JKS)

Descriptors: Difficulty Level, Graphs, Guidelines, Item Analysis

Measuring the Influence of Complexity on Relational Reasoning

Peer reviewed

Direct link

Birney, Damian P.; Halford, Graeme S.; Andrews, Glenda – Educational and Psychological Measurement, 2006

Relational complexity (RC) theory conceptualizes an individual's processing capacity and a task's complexity along a common ordinal metric. The authors describe the development of the Latin Square Task (LST) that assesses the influence of RC on reasoning. The LST minimizes the role of knowledge and storage capacity and thus refines the…

Descriptors: Memory, Age Differences, Cognitive Processes, Psychometrics

Logical Versus Empirical Estimates of Item Difficulty

Peer reviewed

Quereshi, M. Y.; Fisher, Thomas L. – Educational and Psychological Measurement, 1977

Logical estimates of item difficulty made by judges were compared to empirical estimates derived from a test administration. Results indicated substantial correspondence between logical and empirical estimates, and substantial variation among judges. Further, the more elaborate the system used by judges to make estimates, the more accurate the…

Descriptors: Court Judges, Difficulty Level, Evaluation Methods, Item Analysis

Rasch Rating Scale Modeling of the Korean Version of the Beck Depression Inventory

Peer reviewed

Direct link

Hong, Sehee; Wong, Eunice C. – Educational and Psychological Measurement, 2005

The Beck Depression Inventory (BDI) is one of the most frequently used instruments in the study of depression both within and outside of the United States. Though developed primarily with European American clinical populations, the BDI has been applied in nonclinical and non-Western samples. To determine whether such a practice is warranted, the…

Descriptors: Difficulty Level, Rating Scales, Depression (Psychology), Evaluation Methods

A Theoretical Study of the Measurement Effectiveness of Flexilevel Tests

Peer reviewed

Lord, Frederic M. – Educational and Psychological Measurement, 1971

A number of empirical studies are suggested to answer certain questions in connection with flexilevel tests. (MS)

Descriptors: Comparative Analysis, Difficulty Level, Guessing (Tests), Item Analysis

A Brief Report on a Comparison of Six Scoring Methods for Multiple True-False Items.

Peer reviewed

Tsai, Fu-Ju; Suen, Hoi K. – Educational and Psychological Measurement, 1993

Six methods of scoring multiple true-false items were compared in terms of reliabilities, difficulties, and discrimination. Results suggest that, for norm-referenced score interpretations, there is insufficient evidence to support any one of the methods as superior. For criterion-referenced score interpretations, effects of scoring method must be…

Descriptors: Comparative Analysis, Criterion Referenced Tests, Difficulty Level, Guessing (Tests)

Andrews, Glenda	1
Birney, Damian P.	1
Fisher, Thomas L.	1
Halford, Graeme S.	1
Hamby, Tyler	1
Hong, Sehee	1
Kubinger, Klaus D.	1
Lenhard, Alexandra	1
Lenhard, Wolfgang	1
Lord, Frederic M.	1
Quereshi, M. Y.	1
Rogers, Paul W.	1
Sideridis, Georgios D.	1
Sliter, Katherine A.	1
Suen, Hoi K.	1
Taylor, Wyn	1
Tsai, Fu-Ju	1
Wong, Eunice C.	1
Zickar, Michael J.	1
More ▼