Publication Date
In 2025 | 0 |
Since 2024 | 3 |
Since 2021 (last 5 years) | 24 |
Since 2016 (last 10 years) | 80 |
Since 2006 (last 20 years) | 143 |
Descriptor
Correlation | 181 |
Test Items | 181 |
Test Validity | 118 |
Foreign Countries | 72 |
Test Reliability | 72 |
Test Construction | 65 |
Factor Analysis | 62 |
Scores | 48 |
Statistical Analysis | 46 |
Item Analysis | 43 |
Construct Validity | 37 |
More ▼ |
Source
Author
Liu, Ou Lydia | 4 |
Farina, Kristy | 3 |
Kobrin, Jennifer L. | 3 |
LaVenia, Mark | 3 |
Schoen, Robert C. | 3 |
Beglar, David | 2 |
Champagne, Zachary M. | 2 |
Dikmenli, Yurdal | 2 |
Mao, Liyang | 2 |
Marsh, Herbert W. | 2 |
Sackett, Paul R. | 2 |
More ▼ |
Publication Type
Education Level
Audience
Researchers | 5 |
Teachers | 2 |
Practitioners | 1 |
Location
Turkey | 19 |
Canada | 5 |
Germany | 5 |
California | 4 |
China | 3 |
Florida | 3 |
Indonesia | 3 |
Iran | 3 |
Japan | 3 |
United Kingdom | 3 |
Australia | 2 |
More ▼ |
Laws, Policies, & Programs
Individuals with Disabilities… | 1 |
No Child Left Behind Act 2001 | 1 |
United Nations Convention on… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Hartono, Wahyu; Hadi, Samsul; Rosnawati, Raden; Retnawati, Heri – Pegem Journal of Education and Instruction, 2023
Researchers design diagnostic assessments to measure students' knowledge structures and processing skills to provide information about their cognitive attribute. The purpose of this study is to determine the instrument's validity and score reliability, as well as to investigate the use of classical test theory to identify item characteristics. The…
Descriptors: Diagnostic Tests, Test Validity, Item Response Theory, Content Validity
David Bell; Vikki O'Neill; Vivienne Crawford – Practitioner Research in Higher Education, 2023
We compared the influence of open-book extended duration versus closed book time-limited format on reliability and validity of written assessments of pharmacology learning outcomes within our medical and dental courses. Our dental cohort undertake a mid-year test (30xfree-response short answer to a question, SAQ) and end-of-year paper (4xSAQ,…
Descriptors: Undergraduate Students, Pharmacology, Pharmaceutical Education, Test Format
Luan, Lin; Liang, Jyh-Chong; Chai, Ching Sing; Lin, Tzu-Bin; Dong, Yan – Interactive Learning Environments, 2023
The emergence of new media technologies has empowered individuals to not merely consume but also create, share and critique media contents. Such activities are dependent on new media literacy (NML) necessary for living and working in the participatory culture of the twenty-first century. Although a burgeoning body of research has focused on the…
Descriptors: Foreign Countries, Media Literacy, Test Construction, English (Second Language)
Gill, Tim – Research Matters, 2022
In Comparative Judgement (CJ) exercises, examiners are asked to look at a selection of candidate scripts (with marks removed) and order them in terms of which they believe display the best quality. By including scripts from different examination sessions, the results of these exercises can be used to help with maintaining standards. Results from…
Descriptors: Comparative Analysis, Decision Making, Scripts, Standards
An, Lily Shiao; Ho, Andrew Dean; Davis, Laurie Laughlin – Educational Measurement: Issues and Practice, 2022
Technical documentation for educational tests focuses primarily on properties of individual scores at single points in time. Reliability, standard errors of measurement, item parameter estimates, fit statistics, and linking constants are standard technical features that external stakeholders use to evaluate items and individual scale scores.…
Descriptors: Documentation, Scores, Evaluation Methods, Longitudinal Studies
Amber Dudley; Emma Marsden; Giulia Bovolenta – Language Testing, 2024
Vocabulary knowledge strongly predicts second language reading, listening, writing, and speaking. Yet, few tests have been developed to assess vocabulary knowledge in French. The primary aim of this pilot study was to design and initially validate the Context-Aligned Two Thousand Test (CA-TTT), following open research practices. The CA-TTT is a…
Descriptors: French, Vocabulary Development, Secondary School Students, Language Tests
Yoo Jeong Jang – ProQuest LLC, 2022
Despite the increasing demand for diagnostic information, observed subscores have been often reported to lack adequate psychometric qualities such as reliability, distinctiveness, and validity. Therefore, several statistical techniques based on CTT and IRT frameworks have been proposed to improve the quality of subscores. More recently, DCM has…
Descriptors: Classification, Accuracy, Item Response Theory, Correlation
Ferrari-Bridgers, Franca – International Journal of Listening, 2023
While many tools exist to assess student content knowledge, there are few that assess whether students display the critical listening skills necessary to interpret the quality of a speaker's message at the college level. The following research provides preliminary evidence for the internal consistency and factor structure of a tool, the…
Descriptors: Factor Structure, Test Validity, Community College Students, Test Reliability
Raykov, Tenko; Marcoulides, George A. – Educational and Psychological Measurement, 2019
This note discusses the merits of coefficient alpha and their conditions in light of recent critical publications that miss out on significant research findings over the past several decades. That earlier research has demonstrated the empirical relevance and utility of coefficient alpha under certain empirical circumstances. The article highlights…
Descriptors: Test Validity, Test Reliability, Test Items, Correlation
Hae In Park – English Teaching, 2024
The present study aimed to validate a 70-item Korean bilingual version of the Vocabulary Size Test (VST) using Rasch modeling. The goal was to assess the applicability of this Korean version of the VST for Korean learners of English in an English as a foreign language (EFL) context by examining validity evidence based on Messick's framework.…
Descriptors: Korean, Bilingualism, English (Second Language), Second Language Learning
Temel, Senar; Sen, Senol; Özcan, Özgür – Research in Science & Technological Education, 2018
Background: Determining individuals' views of the nature of science is quite important for researchers since it is both a component of scientific literacy and a fundamental aim of science education. Purpose: This study aims to develop a NOSvs for assessing prospective teachers' views of the nature of science and to analyse their psychometric…
Descriptors: Scientific Principles, Test Construction, Preservice Teachers, Student Teacher Attitudes
Erol, Ahmet; Yurdakal, Ibrahim Halil; Tekin Karagöz, Ceren – Malaysian Online Journal of Educational Technology, 2023
The "metaverse," which bridges augmented and virtual reality as mixed reality and includes technological phenomena such as artificial intelligence, continues to be an agenda topic. It is foreseen that the concept in question will accelerate the changes in education and teaching activities, as in many other fields. In this research, a…
Descriptors: Computer Simulation, Artificial Intelligence, Likert Scales, Preservice Teachers
Selcuk Acar; Denis Dumas; Peter Organisciak; Kelly Berthiaume – Grantee Submission, 2024
Creativity is highly valued in both education and the workforce, but assessing and developing creativity can be difficult without psychometrically robust and affordable tools. The open-ended nature of creativity assessments has made them difficult to score, expensive, often imprecise, and therefore impractical for school- or district-wide use. To…
Descriptors: Thinking Skills, Elementary School Students, Artificial Intelligence, Measurement Techniques
Akhtar, Hanif – International Association for Development of the Information Society, 2022
When examinees perceive a test as low stakes, it is logical to assume that some of them will not put out their maximum effort. This condition makes the validity of the test results more complicated. Although many studies have investigated motivational fluctuation across tests during a testing session, only a small number of studies have…
Descriptors: Intelligence Tests, Student Motivation, Test Validity, Student Attitudes
Wang, Yan; Kim, Eun Sook; Dedrick, Robert F.; Ferron, John M.; Tan, Tony – Educational and Psychological Measurement, 2018
Wording effects associated with positively and negatively worded items have been found in many scales. Such effects may threaten construct validity and introduce systematic bias in the interpretation of results. A variety of models have been applied to address wording effects, such as the correlated uniqueness model and the correlated traits and…
Descriptors: Test Items, Test Format, Correlation, Construct Validity