Publication Date
In 2025 | 0 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 4 |
Since 2016 (last 10 years) | 7 |
Since 2006 (last 20 years) | 16 |
Descriptor
Test Reliability | 25 |
Models | 14 |
Test Validity | 11 |
Evaluation Methods | 6 |
Item Analysis | 6 |
Measurement Techniques | 6 |
Mathematical Models | 5 |
Measurement | 4 |
Simulation | 4 |
Structural Equation Models | 4 |
Test Construction | 4 |
More ▼ |
Source
Author
Raykov, Tenko | 2 |
Bao, Lei | 1 |
Bartram, Dave | 1 |
Bertenthal, Bennett I. | 1 |
Bingsheng Zhang | 1 |
Bowles, Tyler J. | 1 |
Burton, Richard F. | 1 |
Chaplin, Duncan | 1 |
Chen, Cheng | 1 |
Cobern, William W. | 1 |
Daud, Nuraihan Mat | 1 |
More ▼ |
Publication Type
Reports - Descriptive | 25 |
Journal Articles | 19 |
Opinion Papers | 2 |
Speeches/Meeting Papers | 2 |
Computer Programs | 1 |
Numerical/Quantitative Data | 1 |
Reports - Evaluative | 1 |
Education Level
Higher Education | 4 |
Elementary Secondary Education | 3 |
Postsecondary Education | 3 |
Audience
Researchers | 1 |
Location
Russia | 1 |
Texas (Austin) | 1 |
United Kingdom | 1 |
West Germany | 1 |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
Graduate Record Examinations | 1 |
Wechsler Adult Intelligence… | 1 |
What Works Clearinghouse Rating
Tenko Raykov; Bingsheng Zhang – Structural Equation Modeling: A Multidisciplinary Journal, 2024
Multidimensional measuring instruments are often used in behavioral, social, educational, marketing, and biomedical research. For these scales, the paper discusses how to find the optimal score based on their components that is associated with the highest possible reliability. Within the framework of structural equation modeling, an approach to…
Descriptors: Multidimensional Scaling, Measurement Equipment, Measurement Techniques, Test Reliability
Raykov, Tenko; Marcoulides, George A. – Measurement: Interdisciplinary Research and Perspectives, 2023
This article outlines a readily applicable procedure for point and interval estimation of the population discrepancy between reliability and the popular Cronbach's coefficient alpha for unidimensional multi-component measuring instruments with uncorrelated errors, which are widely used in behavioral and social research. The method is developed…
Descriptors: Measurement, Test Reliability, Measurement Techniques, Error of Measurement
Zumbo, Bruno D.; Kroc, Edward – Educational and Psychological Measurement, 2019
Chalmers recently published a critique of the use of ordinal a[alpha] proposed in Zumbo et al. as a measure of test reliability in certain research settings. In this response, we take up the task of refuting Chalmers' critique. We identify three broad misconceptions that characterize Chalmers' criticisms: (1) confusing assumptions with…
Descriptors: Test Reliability, Statistical Analysis, Misconceptions, Mathematical Models
Kaufman, Alan S. – Journal of Intelligence, 2021
U.S. Supreme Court justices and other federal judges are, effectively, appointed for life, with no built-in check on their cognitive functioning as they approach old age. There is about a century of research on aging and intelligence that shows the vulnerability of processing speed, fluid reasoning, visual-spatial processing, and working memory to…
Descriptors: Judges, Federal Government, Aging (Individuals), Decision Making
Bao, Lei; Koenig, Kathleen; Xiao, Yang; Fritchman, Joseph; Zhou, Shaona; Chen, Cheng – Physical Review Physics Education Research, 2022
Abilities in scientific thinking and reasoning have been emphasized as core areas of initiatives, such as the Next Generation Science Standards or the College Board Standards for College Success in Science, which focus on the skills the future will demand of today's students. Although there is rich literature on studies of how these abilities…
Descriptors: Physics, Science Instruction, Teaching Methods, Thinking Skills
Hutchins, Shaun D. – Online Submission, 2019
The purpose of this Professional Pathways for Teachers (PPfT) evaluation was to examine the measurement validity and reliability of PPfT appraisal data from the 2017-2018 school year in the Austin Independent School District. The PPfT appraisal is a multi-measure system that covers three areas: instructional practices (IP), professional growth and…
Descriptors: Test Validity, Test Reliability, School Districts, Teacher Evaluation
Gorbunova, Tatiana N. – European Journal of Contemporary Education, 2017
The subject of the research is to build methodologies to evaluate the student knowledge by testing. The author points to the importance of feedback about the mastering level in the learning process. Testing is considered as a tool. The object of the study is to create the test system models for defence practice problems. Special attention is paid…
Descriptors: Testing, Evaluation Methods, Feedback (Response), Simulation
Bertenthal, Bennett I.; Scheutz, Matthias – Cognitive Science, 2013
Cooper et al. (this issue) develop an interactive activation model of spatial and imitative compatibilities that simulates the key results from Catmur and Heyes (2011) and thus conclude that both compatibilities are mediated by the same processes since their single model can predict all the results. Although the model is impressive, the…
Descriptors: Models, Test Validity, Test Reliability, Reader Response
Greiff, Samuel; Wustenberg, Sascha; Funke, Joachim – Applied Psychological Measurement, 2012
This article addresses two unsolved measurement issues in dynamic problem solving (DPS) research: (a) unsystematic construction of DPS tests making a comparison of results obtained in different studies difficult and (b) use of time-intensive single tasks leading to severe reliability problems. To solve these issues, the MicroDYN approach is…
Descriptors: Problem Solving, Tests, Measurement, Structural Equation Models
Lindley, Patricia A.; Bartram, Dave – International Journal of Testing, 2012
In this article, we present the background to the development of test reviewing by the British Psychological Society (BPS) in the United Kingdom. We also describe the role played by the BPS in the development of the EFPA test review model and its adaptation for use in test reviewing in the United Kingdom. We conclude with a discussion of lessons…
Descriptors: Test Reviews, Professional Associations, Psychology, Global Approach
Goldhaber, Dan; Chaplin, Duncan – Center for Education Data & Research, 2012
In a provocative and influential paper, Jesse Rothstein (2010) finds that standard value added models (VAMs) suggest implausible future teacher effects on past student achievement, a finding that obviously cannot be viewed as causal. This is the basis of a falsification test (the Rothstein falsification test) that appears to indicate bias in VAM…
Descriptors: School Effectiveness, Teacher Effectiveness, Achievement Gains, Statistical Bias
Mainhard, Tim; van der Rijst, Roeland; van Tartwijk, Jan; Wubbels, Theo – Higher Education: The International Journal of Higher Education and Educational Planning, 2009
The supervisor-doctoral student interpersonal relationship is important for the success of a PhD-project. Therefore, information about doctoral students' perceptions of their relationship with their supervisor can be useful for providing detailed feedback to supervisors aiming at improving the quality of their supervision. This paper describes the…
Descriptors: Feedback (Response), Student Attitudes, Test Validity, Measures (Individuals)
Lee, Won-Chan – Applied Psychological Measurement, 2007
This article introduces a multinomial error model, which models an examinee's test scores obtained over repeated measurements of an assessment that consists of polytomously scored items. A compound multinomial error model is also introduced for situations in which items are stratified according to content categories and/or prespecified numbers of…
Descriptors: Simulation, Error of Measurement, Scoring, Test Items
Young, John W. – Educational Assessment, 2009
In this article, I specify a conceptual framework for test validity research on content assessments taken by English language learners (ELLs) in U.S. schools in grades K-12. This framework is modeled after one previously delineated by Willingham et al. (1988), which was developed to guide research on students with disabilities. In this framework…
Descriptors: Test Validity, Evaluation Research, Achievement Tests, Elementary Secondary Education
Raykov, Tenko; du Toit, Stephen H. C. – Structural Equation Modeling: A Multidisciplinary Journal, 2005
A method for estimation of reliability for multiple-component measuring instruments with clustered data is outlined. The approach is applicable with hierarchical designs where individuals are nested within higher order units and exhibit possibly related performance on components of a scale of interest. The procedure is developed within the…
Descriptors: Structural Equation Models, Computation, Measurement Techniques, Test Reliability
Previous Page | Next Page »
Pages: 1 | 2