ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	4
Since 2016 (last 10 years)	7
Since 2006 (last 20 years)	16

Descriptor

Test Reliability	25
Models	14
Test Validity	11
Evaluation Methods	6
Item Analysis	6
Measurement Techniques	6
Mathematical Models	5
Measurement	4
Simulation	4
Structural Equation Models	4
Test Construction	4
Test Items	4
Cognitive Processes	3
College Students	3
Computation	3
Computer Assisted Testing	3
Elementary Secondary Education	3
Error of Measurement	3
Evaluation Research	3
Feedback (Response)	3
Foreign Countries	3
Item Response Theory	3
Predictive Validity	3
Problem Solving	3
Psychometrics	3
More ▼

Source

Applied Psychological…	3
International Journal of…	2
Structural Equation Modeling:…	2
Assessment & Evaluation in…	1
Center for Education Data &…	1
Cognitive Science	1
Computer Assisted Language…	1
Educational Assessment	1
Educational and Psychological…	1
European Journal of…	1
Higher Education: The…	1
Journal of College Student…	1
Journal of Educational and…	1
Journal of Intelligence	1
Measurement:…	1
Online Submission	1
Physical Review Physics…	1
More ▼

Publication Type

Reports - Descriptive	25
Journal Articles	19
Opinion Papers	2
Speeches/Meeting Papers	2
Computer Programs	1
Numerical/Quantitative Data	1
Reports - Evaluative	1

Education Level

Higher Education	4
Elementary Secondary Education	3
Postsecondary Education	3

Audience

Researchers

Location

Russia	1
Texas (Austin)	1
United Kingdom	1
West Germany	1

Laws, Policies, & Programs

No Child Left Behind Act 2001

Assessments and Surveys

Graduate Record Examinations	1
Wechsler Adult Intelligence…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 25 results Save | Export

Evaluation of Maximal Reliability for Multidimensional Measuring Instruments Using Structural Equation Modeling

Peer reviewed

Direct link

Tenko Raykov; Bingsheng Zhang – Structural Equation Modeling: A Multidisciplinary Journal, 2024

Multidimensional measuring instruments are often used in behavioral, social, educational, marketing, and biomedical research. For these scales, the paper discusses how to find the optimal score based on their components that is associated with the highest possible reliability. Within the framework of structural equation modeling, an approach to…

Descriptors: Multidimensional Scaling, Measurement Equipment, Measurement Techniques, Test Reliability

Evaluating the Discrepancy between Scale Reliability and Cronbach's Coefficient Alpha Using Latent Variable Modeling

Peer reviewed

Direct link

Raykov, Tenko; Marcoulides, George A. – Measurement: Interdisciplinary Research and Perspectives, 2023

This article outlines a readily applicable procedure for point and interval estimation of the population discrepancy between reliability and the popular Cronbach's coefficient alpha for unidimensional multi-component measuring instruments with uncorrelated errors, which are widely used in behavioral and social research. The method is developed…

Descriptors: Measurement, Test Reliability, Measurement Techniques, Error of Measurement

A Measurement Is a Choice and Stevens' Scales of Measurement Do Not Help Make It: A Response to Chalmers

Peer reviewed

Direct link

Zumbo, Bruno D.; Kroc, Edward – Educational and Psychological Measurement, 2019

Chalmers recently published a critique of the use of ordinal a[alpha] proposed in Zumbo et al. as a measure of test reliability in certain research settings. In this response, we take up the task of refuting Chalmers' critique. We identify three broad misconceptions that characterize Chalmers' criticisms: (1) confusing assumptions with…

Descriptors: Test Reliability, Statistical Analysis, Misconceptions, Mathematical Models

The Precipitous Decline in Reasoning and Other Key Abilities with Age and Its Implications for Federal Judges

Peer reviewed
PDF on ERIC

Download full text

Kaufman, Alan S. – Journal of Intelligence, 2021

U.S. Supreme Court justices and other federal judges are, effectively, appointed for life, with no built-in check on their cognitive functioning as they approach old age. There is about a century of research on aging and intelligence that shows the vulnerability of processing speed, fluid reasoning, visual-spatial processing, and working memory to…

Descriptors: Judges, Federal Government, Aging (Individuals), Decision Making

Theoretical Model and Quantitative Assessment of Scientific Thinking and Reasoning

Peer reviewed

Direct link

Bao, Lei; Koenig, Kathleen; Xiao, Yang; Fritchman, Joseph; Zhou, Shaona; Chen, Cheng – Physical Review Physics Education Research, 2022

Abilities in scientific thinking and reasoning have been emphasized as core areas of initiatives, such as the Next Generation Science Standards or the College Board Standards for College Success in Science, which focus on the skills the future will demand of today's students. Although there is rich literature on studies of how these abilities…

Descriptors: Physics, Science Instruction, Teaching Methods, Thinking Skills

Measurement Validity and Reliability of Professional Pathways for Teachers: Research Brief. Publication 18.17 RB

Download full text

Hutchins, Shaun D. – Online Submission, 2019

The purpose of this Professional Pathways for Teachers (PPfT) evaluation was to examine the measurement validity and reliability of PPfT appraisal data from the 2017-2018 school year in the Austin Independent School District. The PPfT appraisal is a multi-measure system that covers three areas: instructional practices (IP), professional growth and…

Descriptors: Test Validity, Test Reliability, School Districts, Teacher Evaluation

Testing Methodology in the Student Learning Process

Peer reviewed
PDF on ERIC

Download full text

Gorbunova, Tatiana N. – European Journal of Contemporary Education, 2017

The subject of the research is to build methodologies to evaluate the student knowledge by testing. The author points to the importance of feedback about the mastering level in the learning process. Testing is considered as a tool. The object of the study is to create the test system models for defence practice problems. Special attention is paid…

Descriptors: Testing, Evaluation Methods, Feedback (Response), Simulation

In Praise of a Model but Not Its Conclusions: Commentary on Cooper, Catmur, and Heyes (2012)

Peer reviewed

Direct link

Bertenthal, Bennett I.; Scheutz, Matthias – Cognitive Science, 2013

Cooper et al. (this issue) develop an interactive activation model of spatial and imitative compatibilities that simulates the key results from Catmur and Heyes (2011) and thus conclude that both compatibilities are mediated by the same processes since their single model can predict all the results. Although the model is impressive, the…

Descriptors: Models, Test Validity, Test Reliability, Reader Response

Dynamic Problem Solving: A New Assessment Perspective

Peer reviewed

Direct link

Greiff, Samuel; Wustenberg, Sascha; Funke, Joachim – Applied Psychological Measurement, 2012

This article addresses two unsolved measurement issues in dynamic problem solving (DPS) research: (a) unsystematic construction of DPS tests making a comparison of results obtained in different studies difficult and (b) use of time-intensive single tasks leading to severe reliability problems. To solve these issues, the MicroDYN approach is…

Descriptors: Problem Solving, Tests, Measurement, Structural Equation Models

Use of the EFPA Test Review Model by the UK and Issues Relating to the Internationalization of Test Standards

Peer reviewed

Direct link

Lindley, Patricia A.; Bartram, Dave – International Journal of Testing, 2012

In this article, we present the background to the development of test reviewing by the British Psychological Society (BPS) in the United Kingdom. We also describe the role played by the BPS in the development of the EFPA test review model and its adaptation for use in test reviewing in the United Kingdom. We conclude with a discussion of lessons…

Descriptors: Test Reviews, Professional Associations, Psychology, Global Approach

Assessing the "Rothstein Falsification Test": Does It Really Show Teacher Value-Added Models Are Biased? CEDR Working Paper No. 2012 1.3

Direct link

Goldhaber, Dan; Chaplin, Duncan – Center for Education Data & Research, 2012

In a provocative and influential paper, Jesse Rothstein (2010) finds that standard value added models (VAMs) suggest implausible future teacher effects on past student achievement, a finding that obviously cannot be viewed as causal. This is the basis of a falsification test (the Rothstein falsification test) that appears to indicate bias in VAM…

Descriptors: School Effectiveness, Teacher Effectiveness, Achievement Gains, Statistical Bias

A Model for the Supervisor-Doctoral Student Relationship

Peer reviewed

Direct link

Mainhard, Tim; van der Rijst, Roeland; van Tartwijk, Jan; Wubbels, Theo – Higher Education: The International Journal of Higher Education and Educational Planning, 2009

The supervisor-doctoral student interpersonal relationship is important for the success of a PhD-project. Therefore, information about doctoral students' perceptions of their relationship with their supervisor can be useful for providing detailed feedback to supervisors aiming at improving the quality of their supervision. This paper describes the…

Descriptors: Feedback (Response), Student Attitudes, Test Validity, Measures (Individuals)

Multinomial and Compound Multinomial Error Models for Tests with Complex Item Scoring

Peer reviewed

Direct link

Lee, Won-Chan – Applied Psychological Measurement, 2007

This article introduces a multinomial error model, which models an examinee's test scores obtained over repeated measurements of an assessment that consists of polytomously scored items. A compound multinomial error model is also introduced for situations in which items are stratified according to content categories and/or prespecified numbers of…

Descriptors: Simulation, Error of Measurement, Scoring, Test Items

A Framework for Test Validity Research on Content Assessments Taken by English Language Learners

Peer reviewed

Direct link

Young, John W. – Educational Assessment, 2009

In this article, I specify a conceptual framework for test validity research on content assessments taken by English language learners (ELLs) in U.S. schools in grades K-12. This framework is modeled after one previously delineated by Willingham et al. (1988), which was developed to guide research on students with disabilities. In this framework…

Descriptors: Test Validity, Evaluation Research, Achievement Tests, Elementary Secondary Education

Estimation of Reliability for Multiple-Component Measuring Instruments in Hierarchical Designs

Peer reviewed

Direct link

Raykov, Tenko; du Toit, Stephen H. C. – Structural Equation Modeling: A Multidisciplinary Journal, 2005

A method for estimation of reliability for multiple-component measuring instruments with clustered data is outlined. The approach is applicable with hierarchical designs where individuals are nested within higher order units and exhibit possibly related performance on components of a scale of interest. The procedure is developed within the…

Descriptors: Structural Equation Models, Computation, Measurement Techniques, Test Reliability

Previous Page | Next Page »

Pages: 1 | 2

Raykov, Tenko	2
Bao, Lei	1
Bartram, Dave	1
Bertenthal, Bennett I.	1
Bingsheng Zhang	1
Bowles, Tyler J.	1
Burton, Richard F.	1
Chaplin, Duncan	1
Chen, Cheng	1
Cobern, William W.	1
Daud, Nuraihan Mat	1
Embretson, Susan E.	1
Feldt, Leonard S.	1
Foster, Jeff L.	1
Fritchman, Joseph	1
Funke, Joachim	1
Goldhaber, Dan	1
Gorbunova, Tatiana N.	1
Gorin, Joanna S.	1
Greiff, Samuel	1
Hutchins, Shaun D.	1
Jones, Jason	1
Kaufman, Alan S.	1
Koenig, Kathleen	1
Kroc, Edward	1
More ▼