ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	9
Since 2006 (last 20 years)	22

Descriptor

Test Reliability	274
Test Validity	274
Higher Education	79
Test Construction	78
Factor Structure	50
Factor Analysis	39
Attitude Measures	36
Rating Scales	36
Psychometrics	31
Correlation	28
College Students	25
Self Concept Measures	25
Elementary Secondary Education	22
Item Analysis	21
Questionnaires	20
Technical Reports	20
Test Items	20
Elementary Education	19
Personality Measures	19
Scores	19
Student Attitudes	19
Teacher Attitudes	18
Foreign Countries	16
Academic Achievement	15
Achievement Tests	15
More ▼

Source

Educational and Psychological…

274

Publication Type

Journal Articles	190
Reports - Research	168
Reports - Evaluative	17
Reports - Descriptive	6
Speeches/Meeting Papers	3
Tests/Questionnaires	3
Numerical/Quantitative Data	1
Opinion Papers	1

Education Level

Higher Education	7
Postsecondary Education	4
Elementary Education	2
High Schools	2
Middle Schools	2
Secondary Education	2
Grade 3	1
Grade 4	1
Junior High Schools	1

Audience

Location

Canada	5
Australia	4
Netherlands	2
Norway	2
Belgium	1
China	1
Finland	1
France	1
Germany	1
India	1
Iran	1
Jordan	1
Malaysia	1
Mexico	1
Michigan	1
New Zealand	1
Nigeria	1
Oregon	1
Pennsylvania	1
Philippines	1
South Africa	1
Spain	1
Washington	1
More ▼

Laws, Policies, & Programs

What Works Clearinghouse Rating

Showing 1 to 15 of 274 results Save | Export

Separation of Traits and Extreme Response Style in IRTree Models: The Role of Mimicry Effects for the Meaningful Interpretation of Estimates

Peer reviewed

Direct link

Viola Merhof; Caroline M. Böhm; Thorsten Meiser – Educational and Psychological Measurement, 2024

Item response tree (IRTree) models are a flexible framework to control self-reported trait measurements for response styles. To this end, IRTree models decompose the responses to rating items into sub-decisions, which are assumed to be made on the basis of either the trait being measured or a response style, whereby the effects of such person…

Descriptors: Item Response Theory, Test Interpretation, Test Reliability, Test Validity

Methods of Detecting Insufficient Effort Responding: Comparisons and Practical Recommendations

Peer reviewed

Direct link

Hong, Maxwell; Steedle, Jeffrey T.; Cheng, Ying – Educational and Psychological Measurement, 2020

Insufficient effort responding (IER) affects many forms of assessment in both educational and psychological contexts. Much research has examined different types of IER, IER's impact on the psychometric properties of test scores, and preprocessing procedures used to detect IER. However, there is a gap in the literature in terms of practical advice…

Descriptors: Responses, Psychometrics, Test Validity, Test Reliability

Thanks Coefficient Alpha, We Still Need You!

Peer reviewed

Direct link

Raykov, Tenko; Marcoulides, George A. – Educational and Psychological Measurement, 2019

This note discusses the merits of coefficient alpha and their conditions in light of recent critical publications that miss out on significant research findings over the past several decades. That earlier research has demonstrated the empirical relevance and utility of coefficient alpha under certain empirical circumstances. The article highlights…

Descriptors: Test Validity, Test Reliability, Test Items, Correlation

Treatments of Differential Item Functioning: A Comparison of Four Methods

Peer reviewed

Direct link

Liu, Xiaowen; Jane Rogers, H. – Educational and Psychological Measurement, 2022

Test fairness is critical to the validity of group comparisons involving gender, ethnicities, culture, or treatment conditions. Detection of differential item functioning (DIF) is one component of efforts to ensure test fairness. The current study compared four treatments for items that have been identified as showing DIF: deleting, ignoring,…

Descriptors: Item Analysis, Comparative Analysis, Culture Fair Tests, Test Validity

Are Speeded Tests Unfair? Modeling the Impact of Time Limits on the Gender Gap in Mathematics

Peer reviewed

Direct link

Stoevenbelt, Andrea H.; Wicherts, Jelte M.; Flore, Paulette C.; Phillips, Lorraine A. T.; Pietschnig, Jakob; Verschuere, Bruno; Voracek, Martin; Schwabe, Inga – Educational and Psychological Measurement, 2023

When cognitive and educational tests are administered under time limits, tests may become speeded and this may affect the reliability and validity of the resulting test scores. Prior research has shown that time limits may create or enlarge gender gaps in cognitive and academic testing. On average, women complete fewer items than men when a test…

Descriptors: Timed Tests, Gender Differences, Item Response Theory, Correlation

The Total Score with Maximal Reliability and Maximal Criterion Validity: An Illustration Using a Career Satisfaction Measure

Peer reviewed

Direct link

Fu, Yuanshu; Wen, Zhonglin; Wang, Yang – Educational and Psychological Measurement, 2018

The maximal reliability of a congeneric measure is achieved by weighting item scores to form the optimal linear combination as the total score; it is never lower than the composite reliability of the measure when measurement errors are uncorrelated. The statistical method that renders maximal reliability would also lead to maximal criterion…

Descriptors: Test Reliability, Test Validity, Comparative Analysis, Attitude Measures

Survey Satisficing Inflates Reliability and Validity Measures: An Experimental Comparison of College and Amazon Mechanical Turk Samples

Peer reviewed

Direct link

Hamby, Tyler; Taylor, Wyn – Educational and Psychological Measurement, 2016

This study examined the predictors and psychometric outcomes of survey satisficing, wherein respondents provide quick, "good enough" answers (satisficing) rather than carefully considered answers (optimizing). We administered surveys to university students and respondents--half of whom held college degrees--from a for-pay survey website,…

Descriptors: Surveys, Test Reliability, Test Validity, Comparative Analysis

Reliability and Model Fit

Peer reviewed

Direct link

Stanley, Leanne M.; Edwards, Michael C. – Educational and Psychological Measurement, 2016

The purpose of this article is to highlight the distinction between the reliability of test scores and the fit of psychometric measurement models, reminding readers why it is important to consider both when evaluating whether test scores are valid for a proposed interpretation and/or use. It is often the case that an investigator judges both the…

Descriptors: Test Reliability, Goodness of Fit, Scores, Patients

Improving the Factor Structure of Psychological Scales: The Expanded Format as an Alternative to the Likert Scale Format

Peer reviewed

Direct link

Zhang, Xijuan; Savalei, Victoria – Educational and Psychological Measurement, 2016

Many psychological scales written in the Likert format include reverse worded (RW) items in order to control acquiescence bias. However, studies have shown that RW items often contaminate the factor structure of the scale by creating one or more method factors. The present study examines an alternative scale format, called the Expanded format,…

Descriptors: Factor Structure, Psychological Testing, Alternative Assessment, Test Items

Multigroup Generalizability Analysis of Verbal, Quantitative, and Nonverbal Ability Tests for Culturally and Linguistically Diverse Students

Peer reviewed

Direct link

Lakin, Joni M.; Lai, Emily R. – Educational and Psychological Measurement, 2012

For educators seeking to differentiate instruction, cognitive ability tests sampling multiple content domains, including verbal, quantitative, and nonverbal reasoning, provide superior information about student strengths and weaknesses compared with unidimensional reasoning measures. However, these ability tests have not been fully evaluated with…

Descriptors: Aptitude Tests, Nonverbal Ability, Cognitive Ability, Verbal Ability

Investigating Halo and Ceiling Effects in Student Evaluations of Instruction

Peer reviewed

Direct link

Keeley, Jared W.; English, Taylor; Irons, Jessica; Henslee, Amber M. – Educational and Psychological Measurement, 2013

Many measurement biases affect student evaluations of instruction (SEIs). However, two have been relatively understudied: halo effects and ceiling/floor effects. This study examined these effects in two ways. To examine the halo effect, using a videotaped lecture, we manipulated specific teacher behaviors to be "good" or "bad"…

Descriptors: Robustness (Statistics), Test Bias, Course Evaluation, Student Evaluation of Teacher Performance

A Comparison of Approaches for Improving the Reliability of Objective Level Scores

Peer reviewed

Direct link

Skorupski, William P.; Carvajal, Jorge – Educational and Psychological Measurement, 2010

This study is an evaluation of the psychometric issues associated with estimating objective level scores, often referred to as "subscores." The article begins by introducing the concepts of reliability and validity for subscores from statewide achievement tests. These issues are discussed with reference to popular scaling techniques, classical…

Descriptors: Testing Programs, Test Validity, Achievement Tests, Scores

An Investigation of Calculator Use on Employment Tests of Mathematical Ability: Effects on Reliability, Validity, Test Scores, and Speed of Completion

Peer reviewed

Direct link

Bing, Mark N.; Stewart, Susan M.; Davison, H. Kristl – Educational and Psychological Measurement, 2009

Handheld calculators have been used on the job for more than 30 years, yet the degree to which these devices can affect performance on employment tests of mathematical ability has not been thoroughly examined. This study used a within-subjects research design (N = 167) to investigate the effects of calculator use on test score reliability, test…

Descriptors: Calculators, Mathematics Tests, Occupational Tests, Test Reliability

Development of the Tempe Sorting Task: A Principled Approach to Assessment of Children's Executive Functioning

Peer reviewed

Direct link

Marshall, Seth J.; Wodrich, David L.; Gorin, Joanna S. – Educational and Psychological Measurement, 2009

This study examined psychometric properties of the Tempe Sorting Task (TST), a new measure of executive function (EF) for children. To increase the meaningfulness of test score interpretations, an age-appropriate construct was employed to incorporate Denckla's description of EF. Multiple measures of EF, including the TST, were collected for…

Descriptors: Cognitive Tests, Cognitive Processes, Children, Attention Deficit Hyperactivity Disorder

Psychometric Properties of the Scores on the Behavioral Inhibition and Activation Scales in a Sample of Norwegian Children

Peer reviewed

Direct link

Bjornebekk, Gunnar – Educational and Psychological Measurement, 2009

The primary aim of this study was to examine the psychometric properties of the scores on a version for children of the Carver and White Behavioral Inhibition and Activation scales (the BIS-BAS scales). This involved administering the BIS-BAS scales, the Positive and Negative Affect Schedule, the Junior Eysenck Personality Questionnaire…

Descriptors: Measures (Individuals), Psychometrics, Grade 6, Test Validity

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 19

Michael, William B.	10
Powers, Stephen	7
Mehrabian, Albert	4
Parish, Thomas S.	4
Klein, Alice E.	3
Merritt, Sharon L.	3
Reynolds, William M.	3
Wise, Steven L.	3
Aiken, Lewis R.	2
Aleamoni, Lawrence M.	2
Baldauf, Richard B., Jr.	2
Burrell, Brenda	2
Erford, Bradley T.	2
Frary, Robert B.	2
Gable, Robert K.	2
Green, Kathy	2
Hambleton, Ronald K.	2
Hanna, Gerald S.	2
Knapp, Robert R.	2
Lewis, John	2
Loyd, Brenda H.	2
Marshall, Jon C.	2
Neely, Margery A.	2
Pascale, Pietro J.	2
More ▼

Piers Harris Childrens Self…	6
Dimensions of Self Concept	5
Rotter Internal External…	4
Coopersmith Self Esteem…	3
Differential Aptitude Test	3
Wechsler Intelligence Scale…	3
Academic Self Concept Scale	2
Adjective Check List	2
Computer Attitude Scale	2
General Aptitude Test Battery	2
Learning Style Inventory	2
Maslach Burnout Inventory	2
Mathematics Anxiety Rating…	2
Metropolitan Achievement Tests	2
Minnesota Multiphasic…	2
Nowicki Strickland Locus of…	2
Rosenberg Self Esteem Scale	2
SAT (College Admission Test)	2
Slosson Intelligence Test	2
Wechsler Adult Intelligence…	2
Beck Depression Inventory	1
Behavior Assessment System…	1
California Achievement Tests	1
Career Development Inventory	1
Career Maturity Inventory	1
More ▼