ERIC - Search Results

Publication Date

In 2025	0
Since 2024	3
Since 2021 (last 5 years)	7
Since 2016 (last 10 years)	15
Since 2006 (last 20 years)	52

Descriptor

Test Validity	795
Test Reliability	274
Higher Education	224
Test Construction	142
Factor Structure	127
Factor Analysis	123
Correlation	105
Predictive Validity	89
Rating Scales	85
Academic Achievement	82
Attitude Measures	81
Personality Measures	73
College Students	71
Technical Reports	66
Achievement Tests	62
Psychometrics	59
Elementary Education	57
Item Analysis	57
Self Concept Measures	54
Intelligence Tests	53
Scores	53
Foreign Countries	51
Student Attitudes	50
Test Items	48
High Schools	45
More ▼

Source

Educational and Psychological…

795

Publication Type

Journal Articles	542
Reports - Research	488
Reports - Evaluative	49
Tests/Questionnaires	10
Reports - Descriptive	8
Speeches/Meeting Papers	8
Information Analyses	3
Opinion Papers	3
Numerical/Quantitative Data	1
Reports - General	1

Education Level

Higher Education	17
Postsecondary Education	10
Elementary Education	3
High Schools	3
Secondary Education	3
Middle Schools	2
Adult Basic Education	1
Adult Education	1
Early Childhood Education	1
Elementary Secondary Education	1
Grade 3	1
Grade 4	1
Junior High Schools	1
Preschool Education	1
More ▼

Audience

Practitioners

Location

Canada	11
Australia	8
Israel	7
Netherlands	3
Brazil	2
Germany	2
Iran	2
Mexico	2
New Zealand	2
Nigeria	2
Norway	2
United Kingdom (Great Britain)	2
Belgium	1
Canada (Toronto)	1
China	1
Finland	1
France	1
Greece	1
Guyana	1
Illinois	1
India	1
Indiana	1
Ireland	1
Japan	1
Jordan	1
More ▼

Laws, Policies, & Programs

Comprehensive Employment and…

What Works Clearinghouse Rating

Showing 1 to 15 of 795 results Save | Export

On the Relationship between Item Stem Formulation and Criterion Validity of Multiple-Component Measuring Instruments

Peer reviewed

Direct link

Menold, Natalja; Raykov, Tenko – Educational and Psychological Measurement, 2022

The possible dependency of criterion validity on item formulation in a multicomponent measuring instrument is examined. The discussion is concerned with evaluation of the differences in criterion validity between two or more groups (populations/subpopulations) that have been administered instruments with items having differently formulated item…

Descriptors: Test Items, Measures (Individuals), Test Validity, Difficulty Level

An Illustration of an IRTree Model for Disengagement

Peer reviewed

Direct link

Brian C. Leventhal; Dena Pastor – Educational and Psychological Measurement, 2024

Low-stakes test performance commonly reflects examinee ability and effort. Examinees exhibiting low effort may be identified through rapid guessing behavior throughout an assessment. There has been a plethora of methods proposed to adjust scores once rapid guesses have been identified, but these have been plagued by strong assumptions or the…

Descriptors: College Students, Guessing (Tests), Multiple Choice Tests, Item Response Theory

Separation of Traits and Extreme Response Style in IRTree Models: The Role of Mimicry Effects for the Meaningful Interpretation of Estimates

Peer reviewed

Direct link

Viola Merhof; Caroline M. Böhm; Thorsten Meiser – Educational and Psychological Measurement, 2024

Item response tree (IRTree) models are a flexible framework to control self-reported trait measurements for response styles. To this end, IRTree models decompose the responses to rating items into sub-decisions, which are assumed to be made on the basis of either the trait being measured or a response style, whereby the effects of such person…

Descriptors: Item Response Theory, Test Interpretation, Test Reliability, Test Validity

Measuring Unipolar Traits with Continuous Response Items: Some Methodological and Substantive Developments

Peer reviewed

Direct link

Pere J. Ferrando; Fabia Morales-Vives; Ana Hernández-Dorado – Educational and Psychological Measurement, 2024

In recent years, some models for binary and graded format responses have been proposed to assess unipolar variables or "quasi-traits." These studies have mainly focused on clinical variables that have traditionally been treated as bipolar traits. In the present study, we have made a proposal for unipolar traits measured with continuous…

Descriptors: Item Analysis, Goodness of Fit, Accuracy, Test Validity

Methods of Detecting Insufficient Effort Responding: Comparisons and Practical Recommendations

Peer reviewed

Direct link

Hong, Maxwell; Steedle, Jeffrey T.; Cheng, Ying – Educational and Psychological Measurement, 2020

Insufficient effort responding (IER) affects many forms of assessment in both educational and psychological contexts. Much research has examined different types of IER, IER's impact on the psychometric properties of test scores, and preprocessing procedures used to detect IER. However, there is a gap in the literature in terms of practical advice…

Descriptors: Responses, Psychometrics, Test Validity, Test Reliability

Thanks Coefficient Alpha, We Still Need You!

Peer reviewed

Direct link

Raykov, Tenko; Marcoulides, George A. – Educational and Psychological Measurement, 2019

This note discusses the merits of coefficient alpha and their conditions in light of recent critical publications that miss out on significant research findings over the past several decades. That earlier research has demonstrated the empirical relevance and utility of coefficient alpha under certain empirical circumstances. The article highlights…

Descriptors: Test Validity, Test Reliability, Test Items, Correlation

Treatments of Differential Item Functioning: A Comparison of Four Methods

Peer reviewed

Direct link

Liu, Xiaowen; Jane Rogers, H. – Educational and Psychological Measurement, 2022

Test fairness is critical to the validity of group comparisons involving gender, ethnicities, culture, or treatment conditions. Detection of differential item functioning (DIF) is one component of efforts to ensure test fairness. The current study compared four treatments for items that have been identified as showing DIF: deleting, ignoring,…

Descriptors: Item Analysis, Comparative Analysis, Culture Fair Tests, Test Validity

Are Speeded Tests Unfair? Modeling the Impact of Time Limits on the Gender Gap in Mathematics

Peer reviewed

Direct link

Stoevenbelt, Andrea H.; Wicherts, Jelte M.; Flore, Paulette C.; Phillips, Lorraine A. T.; Pietschnig, Jakob; Verschuere, Bruno; Voracek, Martin; Schwabe, Inga – Educational and Psychological Measurement, 2023

When cognitive and educational tests are administered under time limits, tests may become speeded and this may affect the reliability and validity of the resulting test scores. Prior research has shown that time limits may create or enlarge gender gaps in cognitive and academic testing. On average, women complete fewer items than men when a test…

Descriptors: Timed Tests, Gender Differences, Item Response Theory, Correlation

Scoring Graphical Responses in TIMSS 2019 Using Artificial Neural Networks

Peer reviewed

Direct link

von Davier, Matthias; Tyack, Lillian; Khorramdel, Lale – Educational and Psychological Measurement, 2023

Automated scoring of free drawings or images as responses has yet to be used in large-scale assessments of student achievement. In this study, we propose artificial neural networks to classify these types of graphical responses from a TIMSS 2019 item. We are comparing classification accuracy of convolutional and feed-forward approaches. Our…

Descriptors: Scoring, Networks, Artificial Intelligence, Elementary Secondary Education

The Total Score with Maximal Reliability and Maximal Criterion Validity: An Illustration Using a Career Satisfaction Measure

Peer reviewed

Direct link

Fu, Yuanshu; Wen, Zhonglin; Wang, Yang – Educational and Psychological Measurement, 2018

The maximal reliability of a congeneric measure is achieved by weighting item scores to form the optimal linear combination as the total score; it is never lower than the composite reliability of the measure when measurement errors are uncorrelated. The statistical method that renders maximal reliability would also lead to maximal criterion…

Descriptors: Test Reliability, Test Validity, Comparative Analysis, Attitude Measures

Hypothesis Testing in the Real World

Peer reviewed

Direct link

Miller, Jeff – Educational and Psychological Measurement, 2017

Critics of null hypothesis significance testing suggest that (a) its basic logic is invalid and (b) it addresses a question that is of no interest. In contrast to (a), I argue that the underlying logic of hypothesis testing is actually extremely straightforward and compelling. To substantiate that, I present examples showing that hypothesis…

Descriptors: Hypothesis Testing, Testing Problems, Test Validity, Relevance (Education)

Survey Satisficing Inflates Reliability and Validity Measures: An Experimental Comparison of College and Amazon Mechanical Turk Samples

Peer reviewed

Direct link

Hamby, Tyler; Taylor, Wyn – Educational and Psychological Measurement, 2016

This study examined the predictors and psychometric outcomes of survey satisficing, wherein respondents provide quick, "good enough" answers (satisficing) rather than carefully considered answers (optimizing). We administered surveys to university students and respondents--half of whom held college degrees--from a for-pay survey website,…

Descriptors: Surveys, Test Reliability, Test Validity, Comparative Analysis

Reliability and Model Fit

Peer reviewed

Direct link

Stanley, Leanne M.; Edwards, Michael C. – Educational and Psychological Measurement, 2016

The purpose of this article is to highlight the distinction between the reliability of test scores and the fit of psychometric measurement models, reminding readers why it is important to consider both when evaluating whether test scores are valid for a proposed interpretation and/or use. It is often the case that an investigator judges both the…

Descriptors: Test Reliability, Goodness of Fit, Scores, Patients

Assessing Validity of Measurement in Learning Disabilities Using Hierarchical Generalized Linear Modeling: The Roles of Anxiety and Motivation

Peer reviewed

Direct link

Sideridis, Georgios D. – Educational and Psychological Measurement, 2016

The purpose of the present studies was to test the hypothesis that the psychometric characteristics of ability scales may be significantly distorted if one accounts for emotional factors during test taking. Specifically, the present studies evaluate the effects of anxiety and motivation on the item difficulties of the Rasch model. In Study 1, the…

Descriptors: Learning Disabilities, Test Validity, Measures (Individuals), Hierarchical Linear Modeling

Response Styles and the Rural-Urban Divide

Peer reviewed

Direct link

Thomas, Troy D.; Abts, Koen; Vander Weyden, Patrick – Educational and Psychological Measurement, 2014

This article investigates the effect of the rural-urban divide on mean response styles (RSs) and their relationships with the sociodemographic characteristics of the respondents. It uses the Representative Indicator Response Style Means and Covariance Structure (RIRSMACS) method and data from Guyana--a developing country in the Caribbean. The…

Descriptors: Rural Urban Differences, Response Style (Tests), Demography, Social Characteristics

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 53

Michael, William B.	34
Powers, Stephen	10
Martin, John D.	9
Knapp, Robert R.	8
Schriesheim, Chester A.	8
Hanna, Gerald S.	7
Plake, Barbara S.	6
Reynolds, Cecil R.	6
Hakstian, A. Ralph	5
Klein, Alice E.	5
Lewis, John	5
Michael, Joan J.	5
Omizo, Michael M.	5
Willoughby, T. Lee	5
Baldauf, Richard B., Jr.	4
Chissom, Brad S.	4
Knapp-Lee, Lisa	4
Krus, David J.	4
Mehrabian, Albert	4
Parish, Thomas S.	4
Reynolds, William M.	4
Shaffer, Phyllis	4
Thompson, Bruce	4
Abbott, Robert D.	3
More ▼

Wechsler Intelligence Scale…	15
SAT (College Admission Test)	14
Graduate Record Examinations	12
Piers Harris Childrens Self…	12
ACT Assessment	11
Dimensions of Self Concept	11
Slosson Intelligence Test	9
Coopersmith Self Esteem…	7
Metropolitan Readiness Tests	7
Personal Orientation Inventory	7
General Aptitude Test Battery	6
Maslach Burnout Inventory	6
Raven Progressive Matrices	6
Rotter Internal External…	6
California Psychological…	5
Career Maturity Inventory	5
Holland Vocational Preference…	5
Iowa Tests of Basic Skills	5
Minnesota Multiphasic…	5
Stanford Achievement Tests	5
Wechsler Adult Intelligence…	5
Adjective Check List	4
California Achievement Tests	4
Comprehensive Tests of Basic…	4
Differential Aptitude Test	4
More ▼