NotesFAQContact Us
Collection
Advanced
Search Tips
Audience
Practitioners1
Location
Minnesota1
Laws, Policies, & Programs
What Works Clearinghouse Rating
Showing 1 to 15 of 17 results Save | Export
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Hongwen Guo; Matthew S. Johnson; Daniel F. McCaffrey; Lixong Gu – ETS Research Report Series, 2024
The multistage testing (MST) design has been gaining attention and popularity in educational assessments. For testing programs that have small test-taker samples, it is challenging to calibrate new items to replenish the item pool. In the current research, we used the item pools from an operational MST program to illustrate how research studies…
Descriptors: Test Items, Test Construction, Sample Size, Scaling
Peer reviewed Peer reviewed
Direct linkDirect link
Ella Anghel; Lale Khorramdel; Matthias von Davier – Large-scale Assessments in Education, 2024
As the use of process data in large-scale educational assessments is becoming more common, it is clear that data on examinees' test-taking behaviors can illuminate their performance, and can have crucial ramifications concerning assessments' validity. A thorough review of the literature in the field may inform researchers and practitioners of…
Descriptors: Educational Assessment, Test Validity, Test Items, Reaction Time
Peer reviewed Peer reviewed
Direct linkDirect link
Rosa, Claudio D.; Collado, Silvia; Larson, Lincoln R. – Journal of Environmental Education, 2022
The New Ecological Paradigm (NEP) scale adapted for use with children (NEP-C) is one of the most frequently used measures of children's environmental beliefs. Though widely utilized, the limitations of the NEP-C instrument are often overlooked. Based on a systematic synthesis of existing literature examining the NEP-C, we argue that the scale…
Descriptors: Attitude Measures, Children, Environment, Beliefs
Peer reviewed Peer reviewed
Direct linkDirect link
Tatiana Chaiban; Zeinab Nahle; Ghaith Assi; Michelle Cherfane – Discover Education, 2024
Background: Since it was first launched, ChatGPT, a Large Language Model (LLM), has been widely used across different disciplines, particularly the medical field. Objective: The main aim of this review is to thoroughly assess the performance of the distinct version of ChatGPT in subspecialty written medical proficiency exams and the factors that…
Descriptors: Medical Education, Accuracy, Artificial Intelligence, Computer Software
Peer reviewed Peer reviewed
Direct linkDirect link
Isbell, Daniel R.; Son, Young-A – Studies in Second Language Acquisition, 2022
Elicited Imitation Tests (EITs) are commonly used in second language acquisition (SLA)/bilingualism research contexts to assess the general oral proficiency of study participants. While previous studies have provided valuable EIT construct-related validity evidence, some key gaps remain. This study uses an integrative data analysis to further…
Descriptors: Bilingualism, Imitation, Language Tests, Second Language Learning
Council of Chief State School Officers, 2020
Any body of research evolves over time. Previous understandings become more nuanced, ideas are supported or rebuked, and, eventually we arrive at a clearer view of the issue. The research on score comparability across computerized devices is no exception. CCSSO [Council of Chief State School Officers] and the Center for Assessment have published…
Descriptors: Computer Assisted Testing, Scores, Intermode Differences, Influence of Technology
Peer reviewed Peer reviewed
Direct linkDirect link
Dadey, Nathan; Lyons, Susan; DePascale, Charles – Applied Measurement in Education, 2018
Evidence of comparability is generally needed whenever there are variations in the conditions of an assessment administration, including variations introduced by the administration of an assessment on multiple digital devices (e.g., tablet, laptop, desktop). This article is meant to provide a comprehensive examination of issues relevant to the…
Descriptors: Evaluation Methods, Computer Assisted Testing, Educational Technology, Technology Uses in Education
Cawthon, Stephanie; Leppo, Rachel – American Annals of the Deaf, 2013
The authors conducted a qualitative meta-analysis of the research on assessment accommodations for students who are deaf or hard of hearing. There were 16 identified studies that analyzed the impact of factors related to student performance on academic assessments across different educational settings, content areas, and types of assessment…
Descriptors: Testing Accommodations, Academic Achievement, Deafness, Hearing Impairments
Cawthon, Stephanie; Leppo, Rachel – Grantee Submission, 2013
The authors conducted a qualitative meta-analysis of the research on assessment accommodations for students who are deaf or hard of hearing. There were 16 identified studies that analyzed the impact of factors related to student performance on academic assessments across different educational settings, content areas, and types of assessment…
Descriptors: Testing Accommodations, Academic Achievement, Deafness, Hearing Impairments
Peer reviewed Peer reviewed
Direct linkDirect link
Forbey, Johnathan D.; Ben-Porath, Yossef S. – Psychological Assessment, 2007
Computerized adaptive testing in personality assessment can improve efficiency by significantly reducing the number of items administered to answer an assessment question. Two approaches have been explored for adaptive testing in computerized personality assessment: item response theory and the countdown method. In this article, the authors…
Descriptors: Personality Traits, Computer Assisted Testing, Test Validity, Personality Assessment
Peer reviewed Peer reviewed
Meijer, Rob R. – Applied Psychological Measurement, 1995
A statistic used by R. Meijer (1994) to determine person-fit referred to the number of errors from the deterministic Guttman model (L. Guttman, 1950), but this was, in fact, based on the number of errors from the deterministic Guttman model as defined by J. Loevinger (1947, 1948). (SLD)
Descriptors: Difficulty Level, Models, Responses, Scaling
Peer reviewed Peer reviewed
Sijtsma, Klaas – Applied Psychological Measurement, 1998
Reviews developments in nonparametric item-response theory (NIRT), from its historic origins in item-response theory (IRT) and scale analysis to new theoretical results for practical test construction. Discusses theoretical results from NIRT often relevant to IRT. Contains 134 references. (SLD)
Descriptors: Item Response Theory, Nonparametric Statistics, Research Methodology, Scores
Whitney, Douglas R.; And Others – 1985
This research brief summarizes the available reliability and validity data available in, but spread throughout, a number of General Educational Development (GED) Testing Service publications. A section on reliability discusses how to determine reliability of a test's scores and two ways of assessing the reliability of a test--internal consistency…
Descriptors: Adult Education, High School Equivalency Programs, Item Analysis, Scores
Peer reviewed Peer reviewed
Aiken, Lewis R. – Journal of Research and Development in Education, 1987
A critical review is presented of research conducted during the past 20 years on multiple-choice tests of achievement and aptitude. The design and use of multiple-choice tests is emphasized, but information concerning the socioeducational implications of relying on such tests is also included. (Author/CB)
Descriptors: Academic Achievement, Academic Aptitude, Educational Sociology, Multiple Choice Tests
Peer reviewed Peer reviewed
Benson, Jeri – Educational and Psychological Measurement, 1981
A review of the research on item writing, item format, test instructions, and item readability indicated the importance of instrument structure in the interpretation of test data. The effect of failing to consider these areas on the content validity of achievement test scores is discussed. (Author/GK)
Descriptors: Achievement Tests, Elementary Secondary Education, Literature Reviews, Scores
Previous Page | Next Page ยป
Pages: 1  |  2