NotesFAQContact Us
Collection
Advanced
Search Tips
Audience
Researchers2
Laws, Policies, & Programs
What Works Clearinghouse Rating
Showing 1 to 15 of 118 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Hung Tan Ha; Duyen Thi Bich Nguyen; Tim Stoeckel – Language Assessment Quarterly, 2025
This article compares two methods for detecting local item dependence (LID): residual correlation examination and Rasch testlet modeling (RTM), in a commonly used 3:6 matching format and an extended matching test (EMT) format. The two formats are hypothesized to facilitate different levels of item dependency due to differences in the number of…
Descriptors: Comparative Analysis, Language Tests, Test Items, Item Analysis
Peer reviewed Peer reviewed
Direct linkDirect link
Ferrari-Bridgers, Franca – International Journal of Listening, 2023
While many tools exist to assess student content knowledge, there are few that assess whether students display the critical listening skills necessary to interpret the quality of a speaker's message at the college level. The following research provides preliminary evidence for the internal consistency and factor structure of a tool, the…
Descriptors: Factor Structure, Test Validity, Community College Students, Test Reliability
Peer reviewed Peer reviewed
PDF on ERIC Download full text
David Bell; Vikki O'Neill; Vivienne Crawford – Practitioner Research in Higher Education, 2023
We compared the influence of open-book extended duration versus closed book time-limited format on reliability and validity of written assessments of pharmacology learning outcomes within our medical and dental courses. Our dental cohort undertake a mid-year test (30xfree-response short answer to a question, SAQ) and end-of-year paper (4xSAQ,…
Descriptors: Undergraduate Students, Pharmacology, Pharmaceutical Education, Test Format
Peer reviewed Peer reviewed
Direct linkDirect link
Slepkov, A. D.; Van Bussel, M. L.; Fitze, K. M.; Burr, W. S. – SAGE Open, 2021
There is a broad literature in multiple-choice test development, both in terms of item-writing guidelines, and psychometric functionality as a measurement tool. However, most of the published literature concerns multiple-choice testing in the context of expert-designed high-stakes standardized assessments, with little attention being paid to the…
Descriptors: Foreign Countries, Undergraduate Students, Student Evaluation, Multiple Choice Tests
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Hartono, Wahyu; Hadi, Samsul; Rosnawati, Raden; Retnawati, Heri – Pegem Journal of Education and Instruction, 2023
Researchers design diagnostic assessments to measure students' knowledge structures and processing skills to provide information about their cognitive attribute. The purpose of this study is to determine the instrument's validity and score reliability, as well as to investigate the use of classical test theory to identify item characteristics. The…
Descriptors: Diagnostic Tests, Test Validity, Item Response Theory, Content Validity
Peer reviewed Peer reviewed
Direct linkDirect link
Zijlmans, Eva A. O.; Tijmstra, Jesper; van der Ark, L. Andries; Sijtsma, Klaas – Educational and Psychological Measurement, 2018
Reliability is usually estimated for a total score, but it can also be estimated for item scores. Item-score reliability can be useful to assess the repeatability of an individual item score in a group. Three methods to estimate item-score reliability are discussed, known as method MS, method [lambda][subscript 6], and method CA. The item-score…
Descriptors: Test Items, Test Reliability, Correlation, Comparative Analysis
Peer reviewed Peer reviewed
Direct linkDirect link
Papenberg, Martin; Musch, Jochen – Applied Measurement in Education, 2017
In multiple-choice tests, the quality of distractors may be more important than their number. We therefore examined the joint influence of distractor quality and quantity on test functioning by providing a sample of 5,793 participants with five parallel test sets consisting of items that differed in the number and quality of distractors.…
Descriptors: Multiple Choice Tests, Test Items, Test Validity, Test Reliability
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Kim, Peter – Language Teaching Research Quarterly, 2021
Foreign language aptitude is defined as one's potential to learn a second language. A language learner with higher aptitude is predicted to learn more, faster, and reach a higher level of proficiency. If this is the case, one way to validate the construct of aptitude and its measure is to conduct a validation study in which measures of aptitude is…
Descriptors: Morphology (Languages), Syntax, Second Language Learning, Second Language Instruction
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Aktas, Meral Cansiz; Tabak, Sanem – European Journal of Educational Research, 2018
This research aims to complete Turkish adaptation, validity and reliability studies for the Math and Me Survey developed by Adelson and McCoach for use in determining the students' attitudes towards mathematics in the transition from primary school to middle school. Within the scope of validity and reliability studies for the scale, data gathered…
Descriptors: Foreign Countries, Test Construction, Test Validity, Test Reliability
Peer reviewed Peer reviewed
Direct linkDirect link
Maxwell, Bruce; Boon, Helen; Tanchuk, Nicolas; Rauwerda, Bryan – Journal of Moral Education, 2021
This article documents the adaptation, piloting and validation of a measure of teachers' ethical sensitivity. To create the test, we modified a measure from dentistry drawing on literature in teacher professional ethics and drew on the expertise of professional ethics scholars and practitioners. Based on the results of Rasch analysis combined with…
Descriptors: Ethics, Moral Values, Scores, Teacher Education Programs
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Eryilmaz, Ali; Sapsaglam, Özkan – Journal of Education and Training Studies, 2018
Subjective well-being is a sign of positive mental health of children. The aim of the present study is to develop subjective well-being increasing strategies scale for children whose mothers' uses are varied 1 to 5. In this study, there were 195 mothers whose mean ages were 31, 49 and standard deviation were 4,71. Satisfaction with life, positive…
Descriptors: Well Being, Mothers, Foreign Countries, Young Children
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Çapan, Bahtiyar Eraslan; Bakioglu, Fuad – Universal Journal of Educational Research, 2016
In this study, reliability and validity are assessed for a Turkish culture adaptation of the Collective Moral Disengagement Scale for Adolescents. The study was carried out in two stages. In the first stage, translation, exploratory factor analysis, internal consistency coefficients, and test-retest method were performed; in the second stage,…
Descriptors: Foreign Countries, Adolescents, Measures (Individuals), Moral Values
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Tapsir, Ruzela; Nik Pa, Nik Azis; Zamri, Sharifah Norul Akmar Bt Syed – Malaysian Online Journal of Educational Sciences, 2018
Values in mathematics classroom is not commonly discussed, researched, implemented, and measured although value is a significant affective aspect of mathematics learning. In this article, it is proposed that an instrument is developed to measure the said values which will benefit the teaching and learning mathematics. Discussion will focus on the…
Descriptors: Test Reliability, Measurement Techniques, Mathematics Instruction, Item Analysis
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Bichi, Ado Abdu; Talib, Rohaya – International Journal of Evaluation and Research in Education, 2018
Testing in educational system perform a number of functions, the results from a test can be used to make a number of decisions in education. It is therefore well accepted in the education literature that, testing is an important element of education. To effectively utilize the tests in educational policies and quality assurance its validity and…
Descriptors: Item Response Theory, Test Items, Test Construction, Decision Making
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Afacan, Senol; Cilden, Seyda – Journal of Education and Learning, 2018
This study was conducted for the purpose of developing a valid and reliable learning strategies scale for students receiving violin education in Departments of Music at Fine Arts High Schools. The scale was applied to 391 violin students receiving education in the 11th and 12th grades in Departments of Music at Fine Arts High Schools in the…
Descriptors: Learning Strategies, Music Education, Music Techniques, Musical Instruments
Previous Page | Next Page »
Pages: 1  |  2  |  3  |  4  |  5  |  6  |  7  |  8