Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 3 |
Since 2016 (last 10 years) | 16 |
Since 2006 (last 20 years) | 27 |
Descriptor
Scores | 198 |
Test Validity | 101 |
Higher Education | 53 |
Validity | 52 |
Test Reliability | 38 |
Test Construction | 34 |
Predictive Validity | 30 |
Test Use | 30 |
Correlation | 24 |
Reliability | 24 |
Achievement Tests | 23 |
More ▼ |
Source
Author
Thompson, Bruce | 14 |
Bowman, Harry L. | 3 |
Kane, Michael | 3 |
Burts, Diane C. | 2 |
Cook, Colleen | 2 |
Hoover, H. D. | 2 |
Kapes, Jerome T. | 2 |
Lee, Yong-Won | 2 |
Melancon, Janet G. | 2 |
Messick, Samuel | 2 |
Reckase, Mark D. | 2 |
More ▼ |
Publication Type
Education Level
Higher Education | 12 |
Postsecondary Education | 9 |
Secondary Education | 9 |
High Schools | 7 |
Elementary Education | 2 |
Early Childhood Education | 1 |
Elementary Secondary Education | 1 |
Grade 10 | 1 |
Grade 12 | 1 |
Grade 3 | 1 |
Grade 4 | 1 |
More ▼ |
Location
Israel | 3 |
North Carolina | 2 |
Tennessee | 2 |
Canada | 1 |
Georgia | 1 |
Indonesia | 1 |
Jamaica | 1 |
Kentucky | 1 |
Lebanon | 1 |
Maryland | 1 |
Massachusetts | 1 |
More ▼ |
Laws, Policies, & Programs
Comprehensive Education… | 2 |
No Child Left Behind Act 2001 | 2 |
Elementary and Secondary… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Akhtar, Hanif – International Association for Development of the Information Society, 2022
When examinees perceive a test as low stakes, it is logical to assume that some of them will not put out their maximum effort. This condition makes the validity of the test results more complicated. Although many studies have investigated motivational fluctuation across tests during a testing session, only a small number of studies have…
Descriptors: Intelligence Tests, Student Motivation, Test Validity, Student Attitudes
Zolfaghari, Maryam; Kosko, Karl W.; Austin, Christine K. – North American Chapter of the International Group for the Psychology of Mathematics Education, 2022
This study presents an extension of the validity argument for the PCK-Fractions measure. PCK-Fractions is designed to assess the effectiveness of professional experiences in facilitating teachers' pedagogical content knowledge (PCK) for children's fraction reasoning in grades 3-5. We examined data across 101 participants from two Midwest…
Descriptors: Fractions, Mathematics Instruction, Pedagogical Content Knowledge, Validity
Dumas, Denis; McNeish, Daniel; Schreiber-Gregory, Deanna; Durning, Steven J.; Torre, Dario – AERA Online Paper Repository, 2020
Dynamic Measurement Modeling (DMM) is a psychometric paradigm that uses longitudinal data to estimate students' capacity to learn over the course of an educational program (i.e., growth scores). Here, we provide justification for this approach in health professions education and demonstrate its proof of concept with three time-points of USMLE Step…
Descriptors: Allied Health Occupations Education, Measurement Techniques, Psychometrics, Longitudinal Studies
Ham, Yeajin; Hwang, Jihyun – AERA Online Paper Repository, 2020
With rising importance of collaborative problem-solving skills (CLPS) for the new information age, the PISA 2015 assessed students' CLPS skills on a large scale. This new attempt to measure CLPS raises an issue of validity. The purpose of this research is to examine evidence for convergent validity of CLPS through investigating the relationships…
Descriptors: Cooperative Learning, Problem Solving, Student Evaluation, International Assessment
Zhang, Xiuyuan – AERA Online Paper Repository, 2019
The main purpose of the study is to evaluate the qualities of human essay ratings for a large-scale assessment using Rasch measurement theory. Specifically, Many-Facet Rasch Measurement (MFRM) was utilized to examine the rating scale category structure and provide important information about interpretations of ratings in the large-scale…
Descriptors: Essays, Evaluators, Writing Evaluation, Reliability
Ji-Eun Lee; Amisha Jindal; Sanika Nitin Patki; Ashish Gurung; Reilly Norum; Erin Ottmar – Grantee Submission, 2022
This paper demonstrates how to apply Machine Learning (ML) techniques to analyze student interaction data collected in an online mathematics game. We examined: (1) how different ML algorithms influenced the precision of middle-school students' (N = 359) performance prediction; and (2) what types of in-game features were associated with student…
Descriptors: Teaching Methods, Algorithms, Mathematics Tests, Computer Games
Peoples, Shelagh M.; Flanagan, Kathleen Marie; Foster, Brandon – AERA Online Paper Repository, 2017
High school graduation is "not yet a reliable indicator of college readiness", (Gaertner & McClarty, 2015, p2). As such, researchers are investigating the use of non-cognitive factors as predictors of college and career readiness (CCR). The College and Career Readiness English Language Arts (ELA) Scale was designed to measure…
Descriptors: English, Language Arts, Self Efficacy, At Risk Students
Ausin, Markel Sanz; Azizsoltani, Hamoon; Barnes, Tiffany; Chi, Min – International Educational Data Mining Society, 2019
Deep Reinforcement Learning (DRL) has been shown to be a very powerful technique in recent years on a wide range of applications. Much of the prior DRL work took the "online" learning approach. However, given the challenges of building accurate simulations for modeling student learning, we investigated applying DRL to induce a…
Descriptors: Reinforcement, Intelligent Tutoring Systems, Teaching Methods, Instructional Effectiveness
Lev, Sagit; Ayalon, Liat – Research on Social Work Practice, 2018
Objective: To describe the quantitative validation of a unique questionnaire to measure moral distress among social workers in long-term care facilities in Israel. Method: Overall, 216 long-term care facilities' social workers took part in the pilot study that included psychometric evaluation and construct validation. Moral distress was examined…
Descriptors: Moral Values, Social Work, Emotional Disturbances, Intervention
Hutt, Stephen; Gardner, Margo; Duckworth, Angela L.; D'Mello, Sidney K. – International Educational Data Mining Society, 2019
We explore generalizability and fairness across sociodemographic groups for predicting on-time college graduation using a national dataset of 41,359 college applications. Our features include sociodemographics, institutional graduation rates, academic achievement, standardized test scores, engagement in extracurricular activities, and work…
Descriptors: Generalization, Predictive Measurement, College Applicants, Time to Degree
Dogan, Ozgur; Savas, Seyfi; Zorlular, Ali – Online Submission, 2018
The core area is made up of muscles, which surrounds the human body like a corset and acts in the stabilization of the body. Core stabilization training can strengthen muscles in this area and provide better stabilization. The purpose of this study is; investigate of the effect of 8-weeks core stabilization training on the FMS (Functional Movement…
Descriptors: Physiology, Team Sports, Muscular Strength, Physical Fitness
Lambert, Richard G.; Kim, Do-Hong; Burts, Diane C. – AERA Online Paper Repository, 2016
Associations between scale scores obtained from a teacher observation-based authentic assessment measure, the "Teaching Strategies GOLD®", and (a) teacher ratings of children's social functioning and learning behaviors, and (b) child performance on external, individually administered direct assessments of academic skills are presented.…
Descriptors: Performance Based Assessment, Preschool Teachers, Preschool Children, Kindergarten
Adaptation of Scientific Reasoning Scale into Turkish and Examination of Its Psychometric Properties
Muslu Kaygisiz, Gülfem; Gürkan, Burcu; Akbas, Ufuk – Educational Sciences: Theory and Practice, 2018
In this study, it is aimed to adapt the Scientific Reasoning Scale (SRS) into Turkish. The translated form has been provided to the students enrolled at different levels together with a form in which they were requested to present what they have understood and the reason of their responses. It was seen that the explanations of students to one item…
Descriptors: Foreign Countries, Science Process Skills, Science Tests, Psychometrics
Benton, Tom – Cambridge Assessment, 2016
The reliability of an assessment is defined as the extent to which candidates' results would remain stable if the entire assessment exercise was repeated. Whilst numerous studies have evaluated the reliability of written examinations, relatively little has been done to quantify the reliability of internal teacher assessment within schools. This is…
Descriptors: Test Reliability, Foreign Countries, History Instruction, English Literature
Beard, Jonathan; Jagesic, Sanja – AERA Online Paper Repository, 2017
Validity evidence to support the use of exam scores for admission to postsecondary institutions is generally provided in the form of correlation coefficients. The measures used to establish the correlations are scores on a particular entrance exam and most typically a student's first-year college grade point average (FYGPA). Correlations…
Descriptors: College Admission, Validity, Scores, College Entrance Examinations