Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 2 |
Since 2016 (last 10 years) | 55 |
Since 2006 (last 20 years) | 101 |
Descriptor
Scores | 147 |
Statistical Analysis | 147 |
Test Reliability | 147 |
Test Validity | 69 |
Correlation | 48 |
Foreign Countries | 47 |
Factor Analysis | 29 |
Test Construction | 29 |
Test Items | 22 |
Comparative Analysis | 21 |
Psychometrics | 21 |
More ▼ |
Source
Author
Abu-Hamour, Bashir | 2 |
Booker, Kevin | 2 |
Brennan, Robert L. | 2 |
Bruch, Julie | 2 |
Cahan, Sorel | 2 |
Gill, Brian | 2 |
Hambleton, Ronald K. | 2 |
Ling, Guangming | 2 |
Livingston, Samuel A. | 2 |
Myers, Charles T. | 2 |
Reynolds, Cecil R. | 2 |
More ▼ |
Publication Type
Education Level
Audience
Researchers | 1 |
Location
Turkey | 9 |
California | 4 |
Germany | 4 |
Jordan | 4 |
Australia | 2 |
Hong Kong | 2 |
Indonesia | 2 |
Iran | 2 |
North Carolina | 2 |
Pennsylvania | 2 |
Spain | 2 |
More ▼ |
Laws, Policies, & Programs
Elementary and Secondary… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Meets WWC Standards without Reservations | 1 |
Meets WWC Standards with or without Reservations | 1 |
Metsämuuronen, Jari – International Journal of Educational Methodology, 2020
Pearson product-moment correlation coefficient between item g and test score X, known as item-test or item-total correlation ("Rit"), and item-rest correlation ("Rir") are two of the most used classical estimators for item discrimination power (IDP). Both "Rit" and "Rir" underestimate IDP caused by the…
Descriptors: Correlation, Test Items, Scores, Difficulty Level
Xiao, Leifeng; Hau, Kit-Tai – Educational and Psychological Measurement, 2023
We examined the performance of coefficient alpha and its potential competitors (ordinal alpha, omega total, Revelle's omega total [omega RT], omega hierarchical [omega h], greatest lower bound [GLB], and coefficient "H") with continuous and discrete data having different types of non-normality. Results showed the estimation bias was…
Descriptors: Statistical Bias, Statistical Analysis, Likert Scales, Statistical Distributions
Merchant, Stefan; Rich, Jessica; Klinger, Don A. – Canadian Journal of Educational Administration and Policy, 2022
Both school and district administrators use the results of standardized, large-scale tests to inform decisions about the need for, or success of, educational programs and interventions. However, test results at the school level are subject to random fluctuations due to changes in cohort, test items, and other factors outside of the school's…
Descriptors: Standardized Tests, Foreign Countries, Generalizability Theory, Scores
Kelly, William E.; Daughtry, Don – College Student Journal, 2018
This study developed an abbreviated form of Barron's (1953) Ego Strength Scale for use in research among college student samples. A version of Barron's scale was administered to 100 undergraduate college students. Using item-total score correlations and internal consistency, the scale was reduced to 18 items (Es18). The Es18 possessed adequate…
Descriptors: Undergraduate Students, Self Concept Measures, Test Length, Scores
Chow, Peter; Chalmers, R. Philip; Flynn, Deborah M.; McLandress, Adam J.; Steadman, Victoria G. L. – College Student Journal, 2018
With the intent of amending the 21-item BDI-II to improve its reliability and validity when administering the scale to nonclinical populations, a survey package consisting of 19 positive items with semantically reflected response options to mirror the negative scenario options in the original BDI-II (excluding items 16 and 18) was created. These…
Descriptors: Depression (Psychology), Measures (Individuals), Test Reliability, Test Validity
Trierweiler, Tammy J.; Lewis, Charles; Smith, Robert L. – Journal of Educational Measurement, 2016
In this study, we describe what factors influence the observed score correlation between an (external) anchor test and a total test. We show that the anchor to full-test observed score correlation is based on two components: the true score correlation between the anchor and total test, and the reliability of the anchor test. Findings using an…
Descriptors: Scores, Correlation, Tests, Test Reliability
Andersson, Björn; Xin, Tao – Educational and Psychological Measurement, 2018
In applications of item response theory (IRT), an estimate of the reliability of the ability estimates or sum scores is often reported. However, analytical expressions for the standard errors of the estimators of the reliability coefficients are not available in the literature and therefore the variability associated with the estimated reliability…
Descriptors: Item Response Theory, Test Reliability, Test Items, Scores
Tan, Shiu Kuan; Chellappan, Kalaivani – Measurement and Evaluation in Counseling and Development, 2018
This study investigated the validity and reliability of scores on the instrument employing Rasch analysis in a sample of 299 Malaysian adolescents aged between 16 and 19 and provided further evidence for the validity among the sub-constructs: social self-efficacy, academic self-efficacy, and emotional self-efficacy.
Descriptors: Test Validity, Test Reliability, Self Efficacy, Questionnaires
Mantzicopoulos, Panayota; French, Brian F.; Patrick, Helen; Watson, J. Samuel; Ahn, Inok – Educational Assessment, 2018
To meet recent accountability mandates, school districts are implementing assessment frameworks to document teachers' effectiveness. Observational assessments play a key role in this process, albeit without compelling evidence of their psychometric rigor. Using a sample of kindergarten teachers, we employed Generalizability theory to investigate…
Descriptors: Preschool Teachers, Kindergarten, Teacher Effectiveness, Generalizability Theory
Minter, Anthony; Pritzker, Suzanne – Research on Social Work Practice, 2017
Objective: This study examines the psychometric strength, including cross-ethnic validity, of two subscales of Muris' Self-Efficacy Questionnaire for Children: Academic Self-Efficacy (ASE) and Social Self-Efficacy (SSE). Methods: A large ethnically diverse sample of 3,358 early and late adolescents completed surveys including the ASE and SSE.…
Descriptors: Questionnaires, Self Efficacy, Psychometrics, Test Validity
Topcu, Çigdem; Erdur-Baker, Özgür – Measurement and Evaluation in Counseling and Development, 2018
The aim of this study is to update the Turkish version of the Revised Cyber Bullying Inventory (RCBI) and eliminate specific technology names. Validity and reliability tests were carried out with 1,803 high school students. The updated version of the RCBI yields valid and reliable scores measuring cyberbullying and victimization.
Descriptors: Foreign Countries, High School Students, Bullying, Computer Mediated Communication
Rae, James R.; Olson, Kristina R. – Developmental Psychology, 2018
The Implicit Association Test (IAT) is increasingly used in developmental research despite minimal evidence of whether children's IAT scores are reliable across time or predictive of behavior. When test-retest reliability and predictive validity have been assessed, the results have been mixed, and because these studies have differed on many…
Descriptors: Pretests Posttests, Test Reliability, Predictive Validity, Association Measures
McRae, Lamerial; Gonzalez, Jennifer E.; Dominguez, Vanessa; Daire, Andrew Patrick; Liu, Xun – Measurement and Evaluation in Counseling and Development, 2018
We examined the construction for a modified Acceptance of Couple Violence (ACV) scale administered to lesbian, gay, bisexual, transgender, and queer college students (N = 266) measuring intimate partner violence. We ran an exploratory and confirmatory factor analysis; results identified 1 factor for the instrument explaining 76% of the variance in…
Descriptors: Factor Analysis, Test Construction, Questionnaires, Measures (Individuals)
Lambert, Matthew C.; January, Stacy-Ann A.; Pierce, Corey D. – Journal of Psychoeducational Assessment, 2018
The Emotional and Behavioral Screener (EBS) is a recently developed teacher-reported brief screening instrument for identifying students who are at-risk of an emotional or behavioral disorder (EBD). Although prior research supports the technical adequacy of scores from the EBS, there is a gap in the literature regarding strong evidence of the…
Descriptors: Screening Tests, Scores, Emotional Disturbances, Behavior Disorders
Westrick, Paul A. – Educational Assessment, 2017
Undergraduate grade point average (GPA) is a commonly employed measure in educational research, serving as a criterion or as a predictor depending on the research question. Over the decades, researchers have used a variety of reliability coefficients to estimate the reliability of undergraduate GPA, which suggests that there has been no consensus…
Descriptors: Undergraduate Students, Test Reliability, College Entrance Examinations, Longitudinal Studies