Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 10 |
Since 2016 (last 10 years) | 29 |
Since 2006 (last 20 years) | 73 |
Descriptor
Correlation | 115 |
Item Analysis | 115 |
Test Validity | 115 |
Test Reliability | 69 |
Factor Analysis | 48 |
Foreign Countries | 46 |
Test Construction | 39 |
Psychometrics | 31 |
Test Items | 29 |
Statistical Analysis | 23 |
Questionnaires | 21 |
More ▼ |
Source
Author
Abrams, Lisa M. | 2 |
Conroy, Maureen A. | 2 |
Klein, Stephen P. | 2 |
McCallum, R. Steve | 2 |
McLeod, Bryce D. | 2 |
Smith, Meghan M. | 2 |
Sutherland, Kevin S. | 2 |
Afacan, Senol | 1 |
Akbaba, Sirri | 1 |
Akhtar, Hanif | 1 |
Aktas, Meral Cansiz | 1 |
More ▼ |
Publication Type
Reports - Research | 87 |
Journal Articles | 73 |
Speeches/Meeting Papers | 9 |
Reports - Evaluative | 8 |
Tests/Questionnaires | 8 |
Dissertations/Theses -… | 2 |
Information Analyses | 1 |
Numerical/Quantitative Data | 1 |
Reports - Descriptive | 1 |
Education Level
Audience
Researchers | 4 |
Location
Turkey | 17 |
Indonesia | 3 |
Iran | 3 |
Singapore | 3 |
Canada | 2 |
China | 2 |
Greece | 2 |
California | 1 |
China (Beijing) | 1 |
Colombia | 1 |
Florida | 1 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Yoo Jeong Jang – ProQuest LLC, 2022
Despite the increasing demand for diagnostic information, observed subscores have been often reported to lack adequate psychometric qualities such as reliability, distinctiveness, and validity. Therefore, several statistical techniques based on CTT and IRT frameworks have been proposed to improve the quality of subscores. More recently, DCM has…
Descriptors: Classification, Accuracy, Item Response Theory, Correlation
Ferrari-Bridgers, Franca – International Journal of Listening, 2023
While many tools exist to assess student content knowledge, there are few that assess whether students display the critical listening skills necessary to interpret the quality of a speaker's message at the college level. The following research provides preliminary evidence for the internal consistency and factor structure of a tool, the…
Descriptors: Factor Structure, Test Validity, Community College Students, Test Reliability
David Bell; Vikki O'Neill; Vivienne Crawford – Practitioner Research in Higher Education, 2023
We compared the influence of open-book extended duration versus closed book time-limited format on reliability and validity of written assessments of pharmacology learning outcomes within our medical and dental courses. Our dental cohort undertake a mid-year test (30xfree-response short answer to a question, SAQ) and end-of-year paper (4xSAQ,…
Descriptors: Undergraduate Students, Pharmacology, Pharmaceutical Education, Test Format
Hartono, Wahyu; Hadi, Samsul; Rosnawati, Raden; Retnawati, Heri – Pegem Journal of Education and Instruction, 2023
Researchers design diagnostic assessments to measure students' knowledge structures and processing skills to provide information about their cognitive attribute. The purpose of this study is to determine the instrument's validity and score reliability, as well as to investigate the use of classical test theory to identify item characteristics. The…
Descriptors: Diagnostic Tests, Test Validity, Item Response Theory, Content Validity
Akhtar, Hanif – International Association for Development of the Information Society, 2022
When examinees perceive a test as low stakes, it is logical to assume that some of them will not put out their maximum effort. This condition makes the validity of the test results more complicated. Although many studies have investigated motivational fluctuation across tests during a testing session, only a small number of studies have…
Descriptors: Intelligence Tests, Student Motivation, Test Validity, Student Attitudes
Boori, Ali Akbar; Ghazanfari, Mohammad; Ghonsooly, Behzad; Baghaei, Purya – International Journal of Language Testing, 2023
Cognitive diagnostic models (CDMs) have received sustained attention in educational settings because they can be used to operationalize formative assessment to provide diagnostic feedback and inform instruction. A large number of CDMs have been developed over the past few years. An important component of all CDMs is a Q-matrix that specifies a…
Descriptors: Reading Comprehension, Reading Tests, English (Second Language), Islam
Pools, Elodie; Monseur, Christian – Large-scale Assessments in Education, 2021
Background: The idea of using low-stakes assessment results is often mentioned when designing educational system reforms. However, when tests have no consequences for the students, test takers may not make enough effort when completing the test, and their lack of engagement may negatively affect the validity of the conclusions of the studies that…
Descriptors: Science Tests, Test Validity, Student Motivation, Learner Engagement
Kosko Karl W.; Singh, Rashmi – Journal of Mathematics Education at Teachers College, 2018
Multiplicative reasoning is a key concept in elementary school mathematics. Item statistics reported by the National Assessment of Educational Progress (NAEP) assessment provide the best current indicator for how well elementary students across the U.S. understand this, and other concepts. However, beyond expert reviews and statistical analysis,…
Descriptors: Elementary School Students, Grade 4, Numeracy, Mathematics Tests
Jamalzadeh, Mehri; Lotfi, Ahmad Reza; Rostami, Masoud – Language Testing in Asia, 2021
The current study sought to examine the validity of a General English Achievement Test (GEAT), administered to university students in the fall semester of 2018-2019 academic year, by hybridizing differential information (DIF) and differential distractor function (DDF) analytical models. Using a purposive sampling method, from the target population…
Descriptors: Language Tests, Achievement Tests, Undergraduate Students, Islam
Papenberg, Martin; Musch, Jochen – Applied Measurement in Education, 2017
In multiple-choice tests, the quality of distractors may be more important than their number. We therefore examined the joint influence of distractor quality and quantity on test functioning by providing a sample of 5,793 participants with five parallel test sets consisting of items that differed in the number and quality of distractors.…
Descriptors: Multiple Choice Tests, Test Items, Test Validity, Test Reliability
Kim, Peter – Language Teaching Research Quarterly, 2021
Foreign language aptitude is defined as one's potential to learn a second language. A language learner with higher aptitude is predicted to learn more, faster, and reach a higher level of proficiency. If this is the case, one way to validate the construct of aptitude and its measure is to conduct a validation study in which measures of aptitude is…
Descriptors: Morphology (Languages), Syntax, Second Language Learning, Second Language Instruction
Aktas, Meral Cansiz; Tabak, Sanem – European Journal of Educational Research, 2018
This research aims to complete Turkish adaptation, validity and reliability studies for the Math and Me Survey developed by Adelson and McCoach for use in determining the students' attitudes towards mathematics in the transition from primary school to middle school. Within the scope of validity and reliability studies for the scale, data gathered…
Descriptors: Foreign Countries, Test Construction, Test Validity, Test Reliability
Maxwell, Bruce; Boon, Helen; Tanchuk, Nicolas; Rauwerda, Bryan – Journal of Moral Education, 2021
This article documents the adaptation, piloting and validation of a measure of teachers' ethical sensitivity. To create the test, we modified a measure from dentistry drawing on literature in teacher professional ethics and drew on the expertise of professional ethics scholars and practitioners. Based on the results of Rasch analysis combined with…
Descriptors: Ethics, Moral Values, Scores, Teacher Education Programs
Eryilmaz, Ali; Sapsaglam, Özkan – Journal of Education and Training Studies, 2018
Subjective well-being is a sign of positive mental health of children. The aim of the present study is to develop subjective well-being increasing strategies scale for children whose mothers' uses are varied 1 to 5. In this study, there were 195 mothers whose mean ages were 31, 49 and standard deviation were 4,71. Satisfaction with life, positive…
Descriptors: Well Being, Mothers, Foreign Countries, Young Children
Çapan, Bahtiyar Eraslan; Bakioglu, Fuad – Universal Journal of Educational Research, 2016
In this study, reliability and validity are assessed for a Turkish culture adaptation of the Collective Moral Disengagement Scale for Adolescents. The study was carried out in two stages. In the first stage, translation, exploratory factor analysis, internal consistency coefficients, and test-retest method were performed; in the second stage,…
Descriptors: Foreign Countries, Adolescents, Measures (Individuals), Moral Values