NotesFAQContact Us
Collection
Advanced
Search Tips
Showing 1 to 15 of 1,234 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Hung Tan Ha; Duyen Thi Bich Nguyen; Tim Stoeckel – Language Assessment Quarterly, 2025
This article compares two methods for detecting local item dependence (LID): residual correlation examination and Rasch testlet modeling (RTM), in a commonly used 3:6 matching format and an extended matching test (EMT) format. The two formats are hypothesized to facilitate different levels of item dependency due to differences in the number of…
Descriptors: Comparative Analysis, Language Tests, Test Items, Item Analysis
Peer reviewed Peer reviewed
Direct linkDirect link
Ehri Ryu – Society for Research on Educational Effectiveness, 2024
Background/Context: Confirmatory factor analysis (CFA) model is a commonly adopted framework to estimate and test a measurement model. Once a well-fitting final CFA model is selected, the selected model may be used to test structural relationships of the latent constructs with other variables, to construct a test with desired reliability and…
Descriptors: Research Problems, Factor Analysis, Scores, Computation
Peer reviewed Peer reviewed
Direct linkDirect link
Viola Merhof; Caroline M. Böhm; Thorsten Meiser – Educational and Psychological Measurement, 2024
Item response tree (IRTree) models are a flexible framework to control self-reported trait measurements for response styles. To this end, IRTree models decompose the responses to rating items into sub-decisions, which are assumed to be made on the basis of either the trait being measured or a response style, whereby the effects of such person…
Descriptors: Item Response Theory, Test Interpretation, Test Reliability, Test Validity
Peer reviewed Peer reviewed
Direct linkDirect link
Muhammed Tayyib Kadak; Nihal Serdengeçti; Meryem Seçen Yazici; Tuncay Sandikçi; Aybike Aydin; Zehra Koyuncu; Yavuz Meral; Abas Hasimoglu; Yasin Çaliskan; Gizem Bayraktar; Elif Can Öztürk; Mehmet Enes Gökler; Roula Choueiri; Mahmut Cem Tarakçioglu – Autism: The International Journal of Research and Practice, 2024
This study aims to investigate the validation of the Rapid Interactive Screening Test for Autism in Toddlers (RITA-T) in Turkish toddlers between 18 and 36 months of age. Children aged 18-36 months were referred to the department of child psychiatry for concerns of autism spectrum disorder, language disorder, developmental delay, and typically…
Descriptors: Foreign Countries, Turkish, Screening Tests, Autism Spectrum Disorders
Peer reviewed Peer reviewed
Direct linkDirect link
Juliana Reyes-Martin; David Simó-Pinatella; Ana Andrés – Journal of Applied Research in Intellectual Disabilities, 2025
Background: Behavioural problems in individuals with intellectual disabilities have a negative impact on them. Limited assessment measures exist in Spain. This study aimed to validate the Behavior Problems Inventory--Short Form (BPI-S) in the Spanish population by examining its psychometric properties and factorial structures. Method: This study…
Descriptors: Foreign Countries, Behavior Problems, Students with Disabilities, Intellectual Disability
Peer reviewed Peer reviewed
PDF on ERIC Download full text
David Bell; Vikki O'Neill; Vivienne Crawford – Practitioner Research in Higher Education, 2023
We compared the influence of open-book extended duration versus closed book time-limited format on reliability and validity of written assessments of pharmacology learning outcomes within our medical and dental courses. Our dental cohort undertake a mid-year test (30xfree-response short answer to a question, SAQ) and end-of-year paper (4xSAQ,…
Descriptors: Undergraduate Students, Pharmacology, Pharmaceutical Education, Test Format
Peer reviewed Peer reviewed
Direct linkDirect link
Branscum, Ann M.; Park, Anna E.; Bihm, Elson M.; Murphy, John J.; Furlong, Michael J. – Psychology in the Schools, 2023
The Social and Emotional Health Survey-Secondary (SEHS-S) is a widely used self-report measure of adolescents' strengths and positive mental health. However, the SEHS-S does not have a proxy (i.e., parent) informant version. A parent form of the SEHS-S could allow school psychologists to gather additional information about students' strengths for…
Descriptors: Test Construction, Test Validity, Mental Health, Parent Attitudes
Peer reviewed Peer reviewed
Direct linkDirect link
Anderson, Lezli Suzanne; McCallum, R. Steve; Castleman, David M.; Fast, Michelle – Psychology in the Schools, 2023
Data collected from 97 educators provide preliminary support for the psychometric integrity of an experimental self-report instrument designed to operationalize emotional intelligence (EI) specifically for educators, the Scale of Emotional Functioning: Educators, or SEF:ED. Data analyses relied in part on results from an exploratory factor…
Descriptors: Measures (Individuals), Emotional Intelligence, Test Construction, Test Validity
Peer reviewed Peer reviewed
Direct linkDirect link
Stoevenbelt, Andrea H.; Wicherts, Jelte M.; Flore, Paulette C.; Phillips, Lorraine A. T.; Pietschnig, Jakob; Verschuere, Bruno; Voracek, Martin; Schwabe, Inga – Educational and Psychological Measurement, 2023
When cognitive and educational tests are administered under time limits, tests may become speeded and this may affect the reliability and validity of the resulting test scores. Prior research has shown that time limits may create or enlarge gender gaps in cognitive and academic testing. On average, women complete fewer items than men when a test…
Descriptors: Timed Tests, Gender Differences, Item Response Theory, Correlation
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Hartono, Wahyu; Hadi, Samsul; Rosnawati, Raden; Retnawati, Heri – Pegem Journal of Education and Instruction, 2023
Researchers design diagnostic assessments to measure students' knowledge structures and processing skills to provide information about their cognitive attribute. The purpose of this study is to determine the instrument's validity and score reliability, as well as to investigate the use of classical test theory to identify item characteristics. The…
Descriptors: Diagnostic Tests, Test Validity, Item Response Theory, Content Validity
Peer reviewed Peer reviewed
Direct linkDirect link
Ferrari-Bridgers, Franca – International Journal of Listening, 2023
While many tools exist to assess student content knowledge, there are few that assess whether students display the critical listening skills necessary to interpret the quality of a speaker's message at the college level. The following research provides preliminary evidence for the internal consistency and factor structure of a tool, the…
Descriptors: Factor Structure, Test Validity, Community College Students, Test Reliability
Peer reviewed Peer reviewed
Direct linkDirect link
Abdullah Alamer; Ahmed Al Khateeb; Abdulrahman Alshabeb – Language Assessment Quarterly, 2025
This study introduces the first Arabic Vocabulary Levels Test (Arabic-VLT), created for foreign learners of Arabic. We present compelling evidence to substantiate its validity and reliability. The Arabic-VLT was developed according to five levels, beginning with the most frequently used words (Level 1) to the least frequently used ones (Level 5),…
Descriptors: Arabic, Vocabulary Development, Test Construction, Second Language Learning
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Barno S. Abdullaeva; Diyorjon Abdullaev; Nurislom I. Khursanov; Khurshida B. Kadirova; Laylo Djuraeva – International Journal of Language Testing, 2024
Cloze tests are commonly used in language testing as a quick measure of overall language ability or reading comprehension. A problem for the analysis of cloze tests with item response theory models is that cloze test items are locally dependent. This leads to the violation of the conditional or local independence assumption of IRT models. In this…
Descriptors: Cloze Procedure, Language Tests, Test Items, Correlation
Peer reviewed Peer reviewed
Direct linkDirect link
Conoyer, Sarah J.; Therrien, William J.; White, Kristen K. – Assessment for Effective Intervention, 2022
Meta-analysis was used to examine curriculum-based measurement in the content areas of social studies and science. Nineteen studies between the years of 1998 and 2020 were reviewed to determine overall mean correlation for criterion validity and examine alternate-form reliability and slope coefficients. An overall mean correlation of 0.59 was…
Descriptors: Curriculum Based Assessment, Test Validity, Test Reliability, Science Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Enrico Gandolfi; Richard E. Ferdig – Educational Technology Research and Development, 2025
Augmented Reality (AR) is increasingly being adopted in education to foster engagement and interest in a variety of subjects and content areas. However, there is a scarcity of instruments to measure the instructional impact of this innovation. This article addresses this gap in two unique ways. First, it presents validation results of the…
Descriptors: Simulated Environment, Measures (Individuals), Rating Scales, Item Response Theory
Previous Page | Next Page »
Pages: 1  |  2  |  3  |  4  |  5  |  6  |  7  |  8  |  9  |  10  |  11  |  ...  |  83