Publication Date
| In 2026 | 0 |
| Since 2025 | 3 |
| Since 2022 (last 5 years) | 7 |
| Since 2017 (last 10 years) | 314 |
| Since 2007 (last 20 years) | 676 |
Descriptor
| Statistical Analysis | 1071 |
| Test Validity | 1071 |
| Test Reliability | 613 |
| Foreign Countries | 375 |
| Factor Analysis | 341 |
| Test Construction | 322 |
| Correlation | 246 |
| Psychometrics | 183 |
| Questionnaires | 169 |
| Scores | 145 |
| College Students | 132 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 15 |
| Practitioners | 7 |
| Students | 3 |
| Teachers | 2 |
| Administrators | 1 |
Location
| Turkey | 87 |
| Australia | 20 |
| California | 17 |
| China | 17 |
| Germany | 15 |
| Jordan | 14 |
| United Kingdom | 13 |
| Iran | 11 |
| Canada | 10 |
| Taiwan | 10 |
| Texas | 10 |
| More ▼ | |
Laws, Policies, & Programs
| Individuals with Disabilities… | 2 |
| Individuals with Disabilities… | 2 |
| Elementary and Secondary… | 1 |
| No Child Left Behind Act 2001 | 1 |
| Safe and Drug Free Schools… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Usani Joseph Ofem; Valentine Joseph Owan; Cletus Ibout; Sylvai Victor Ovat – Pedagogical Research, 2025
This study employed repeated measures ANOVA to assess the reliability of an instrument designed to measure utilization, awareness, and perception of AI in research among 150 undergraduate students. Validated instruments with robust psychometric properties were used for the study. Data collection occurred in three phases spaced two weeks apart,…
Descriptors: Statistical Analysis, Test Reliability, Undergraduate Students, Attitude Measures
El Alaoui, Mohamed – IEEE Transactions on Learning Technologies, 2023
Classical evaluation methods, assessments, exams, and so forth accentuate the perception of one against all, professor versus learners. Including students in the assessment process, allows transforming the professor from an opponent to a critical friend, with the role of helping students to recognize both their strengths and weaknesses. However,…
Descriptors: Peer Evaluation, Educational Improvement, Test Validity, Test Reliability
Wendy Castillo; Rachel Renbarger; Sasha Mejia-Bradford; Christen Priddie; Juan Cruz; Brein Mosely; Katherine Aragon – Annenberg Institute for School Reform at Brown University, 2025
Education policy research aimed at eliminating racism necessitates methodological innovation that fosters both equity-centered approaches and robust empirical analysis of the systemic nature of racism. Most quantitative research in educational psychology omits the racist environment that students in K-12 education exist in (DeCuir-Gunby &…
Descriptors: Racism, Elementary Secondary Education, Racial Discrimination, Surveys
Lyrica Lucas; Anum Khushal; Robert Mayes; Brian A. Couch; Joseph Dauer – International Journal of Science Education, 2025
Educational reform priorities such as emphasis on quantitative modelling (QM) have positioned undergraduate biology instructors as designers of QM experiences to engage students in authentic science practices that support the development of data-driven and evidence-based reasoning. Yet, little is known about how biology instructors adapt to the…
Descriptors: Undergraduate Students, College Science, Biology, Classroom Observation Techniques
de Jong, Peter F. – Journal of Psychoeducational Assessment, 2023
The Wechsler Intelligence Scale for Children--Fifth Edition (WISC-V; Wechsler, 2014) provides a general intelligence score, representing "g," and five index scores, reflecting underlying broad factors. Within person differences between the overall performance across subtests and index scores, denoted as index difference scores, are often…
Descriptors: Test Validity, Children, Intelligence Tests, Indo European Languages
Sengül Avsar, Asiye – Measurement: Interdisciplinary Research and Perspectives, 2020
In order to reach valid and reliable test scores, various test theories have been developed, and one of them is nonparametric item response theory (NIRT). Mokken Models are the most widely known NIRT models which are useful for small samples and short tests. Mokken Package is useful for Mokken Scale Analysis. An important issue about validity is…
Descriptors: Response Style (Tests), Nonparametric Statistics, Item Response Theory, Test Validity
Karadavut, Tugba – Applied Measurement in Education, 2021
Mixture IRT models address the heterogeneity in a population by extracting latent classes and allowing item parameters to vary between latent classes. Once the latent classes are extracted, they need to be further examined to be characterized. Some approaches have been adopted in the literature for this purpose. These approaches examine either the…
Descriptors: Item Response Theory, Models, Test Items, Maximum Likelihood Statistics
Pentimonti, J.; Petscher, Y.; Stanley, C. – National Center on Improving Literacy, 2019
Sample representativeness is an important piece to consider when evaluating the quality of a screening assessment. If you are trying to determine whether or not the screening tool accurately measures children's skills, you want to ensure that the sample that is used to validate the tool is representative of your population of interest.
Descriptors: Sampling, Screening Tests, Measurement, Test Validity
Practices in Instrument Use and Development in "Chemistry Education Research and Practice" 2010-2021
Lazenby, Katherine; Tenney, Kristin; Marcroft, Tina A.; Komperda, Regis – Chemistry Education Research and Practice, 2023
Assessment instruments that generate quantitative data on attributes (cognitive, affective, behavioral, "etc.") of participants are commonly used in the chemistry education community to draw conclusions in research studies or inform practice. Recently, articles and editorials have stressed the importance of providing evidence for the…
Descriptors: Chemistry, Periodicals, Journal Articles, Science Education
Sørensen, Ole Henning; Bjørner, Jakob; Holtermann, Andreas; Dyreborg, Johnny; Sørli, Jorid Birkelund; Kristiansen, Jesper; Nielsen, Steffen Bohni – Research Evaluation, 2022
Research funders and policymakers increasingly focus on societal benefits of their investments in research. Research institutions thus face increasing pressure to demonstrate their societal impact to prove their legitimacy and worth. To this end, research institutions need reliable, quantitative methods to measure societal impact. This article…
Descriptors: Test Construction, Test Validity, Statistical Analysis, Outcome Measures
Knowles, Ryan T.; Hawkman, Andrea M. – Urban Review: Issues and Ideas in Public Education, 2020
This research article reports on the initial findings of a critical quantitative study, which developed and implemented a series of quantitative scales utilizing conceptualizations of racial fragility and anti-racist teacher self-efficacy scales. The scales were administered through a survey and yielded a usable sample of 4770 teachers in…
Descriptors: Statistical Analysis, Racial Discrimination, Self Efficacy, Teachers
Garcia-Garzon, Eduardo; Abad, Francisco J.; Garrido, Luis E. – Journal of Intelligence, 2019
There has been increased interest in assessing the quality and usefulness of short versions of the Raven's Progressive Matrices. A recent proposal, composed of the last twelve matrices of the Standard Progressive Matrices (SPM-LS), has been depicted as a valid measure of "g." Nonetheless, the results provided in the initial validation…
Descriptors: Intelligence Tests, Test Validity, Evaluation Methods, Undergraduate Students
Westbrook, Charles J.; Davis, Don E.; McElroy, Stacey E.; Brubaker, Kacy; Choe, Elise; Karaga, Sara; Dooley, Matt; O'Bryant, Brittany L.; Van Tongeren, Daryl R.; Hook, Joshua – Measurement and Evaluation in Counseling and Development, 2018
We develop the Trait Sources of Spirituality Scale (TSSS), which assesses experiences of closeness to the sacred, within and outside a religious tradition. After using factor analysis to finalize the scale, we examine evidence of construct validity, including latent profile analysis that reveals 5 patterns of how spirituality is experienced.
Descriptors: Measures (Individuals), Religious Factors, Factor Analysis, Test Construction
Chow, Peter; Chalmers, R. Philip; Flynn, Deborah M.; McLandress, Adam J.; Steadman, Victoria G. L. – College Student Journal, 2018
With the intent of amending the 21-item BDI-II to improve its reliability and validity when administering the scale to nonclinical populations, a survey package consisting of 19 positive items with semantically reflected response options to mirror the negative scenario options in the original BDI-II (excluding items 16 and 18) was created. These…
Descriptors: Depression (Psychology), Measures (Individuals), Test Reliability, Test Validity
Louis, Allison J.; Arora, Vineet M.; Matthiesen, Madeleine I.; Meltzer, David O.; Press, Valerie G. – Health Education & Behavior, 2017
As patient-centered education efforts increase, assessing health literacy (HL) becomes more salient. The verbal Brief Health Literacy Screen (BHLS) may have clinical and feasibility advantages over written tools, including the Rapid Estimate of Adult Literacy in Medicine--Revised (REALM-R) and Short Test of Functional Health Literacy in Adults…
Descriptors: Screening Tests, Hospitals, Literacy, Self Management

Peer reviewed
Direct link
