NotesFAQContact Us
Collection
Advanced
Search Tips
Showing 241 to 255 of 47,279 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Xiong, Yao; Schunn, Christian D.; Wu, Yong – Journal of Computer Assisted Learning, 2023
Background: For peer assessment, reliability (i.e., consistency in ratings across peers) and validity (i.e., consistency of peer ratings with instructors or experts) are frequently examined in the research literature to address a central concern of instructors and students. Although the average levels are generally promising, both reliability and…
Descriptors: Peer Evaluation, Computer Assisted Testing, Test Reliability, Test Validity
Peer reviewed Peer reviewed
Direct linkDirect link
Nobuyuki Hanaki; Jan R. Magnus; Donghoon Yoo – Journal of Statistics and Data Science Education, 2023
Common sense is a dynamic concept and it is natural that our (statistical) common sense lags behind the development of statistical science. What is not so easy to understand is why common sense lags behind as much as it does. We conduct a survey among Japanese students and provide examples and tentative explanations of a number of statistical…
Descriptors: Statistics, Statistics Education, Epistemology, Statistical Analysis
Peer reviewed Peer reviewed
Direct linkDirect link
Zinovy Radovilsky; Vishwanath Hegde – Curriculum and Teaching, 2023
The issues of academic integrity across online and in-person assessments were addressed by analyzing student total, conceptual, and numerical performance scores in the three modes of assessment: (1) In-person assessment with proctoring; (2) Online unproctored assessment; and (3) Respondus assessment online with proctoring. It was identified that…
Descriptors: Academic Achievement, Educational Technology, Computer Assisted Testing, Evaluation
New York State Education Department, 2020
The Regulations of the Commissioner of Education provide that an elementary-level science test is to be administered in Grade 4 to serve as a basis for determining students' needs for academic intervention services in science. The New York State Grade 4 Elementary-Level Science Test consists of two required components: a Written Test and a…
Descriptors: Grade 4, Science Tests, Testing Programs, Testing
Peer reviewed Peer reviewed
Direct linkDirect link
Wyse, Adam E. – Educational and Psychological Measurement, 2021
An essential question when computing test--retest and alternate forms reliability coefficients is how many days there should be between tests. This article uses data from reading and math computerized adaptive tests to explore how the number of days between tests impacts alternate forms reliability coefficients. Results suggest that the highest…
Descriptors: Computer Assisted Testing, Adaptive Testing, Test Reliability, Reading Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Sinharay, Sandip – Educational and Psychological Measurement, 2022
Administrative problems such as computer malfunction and power outage occasionally lead to missing item scores and hence to incomplete data on mastery tests such as the AP and U.S. Medical Licensing examinations. Investigators are often interested in estimating the probabilities of passing of the examinees with incomplete data on mastery tests.…
Descriptors: Mastery Tests, Computer Assisted Testing, Probability, Test Wiseness
Peer reviewed Peer reviewed
Direct linkDirect link
LaFlair, Geoffrey T.; Langenfeld, Thomas; Baig, Basim; Horie, André Kenji; Attali, Yigal; von Davier, Alina A. – Journal of Computer Assisted Learning, 2022
Background: Digital-first assessments leverage the affordances of technology in all elements of the assessment process--from design and development to score reporting and evaluation to create test taker-centric assessments. Objectives: The goal of this paper is to describe the engineering, machine learning, and psychometric processes and…
Descriptors: Computer Assisted Testing, Affordances, Scoring, Engineering
Peer reviewed Peer reviewed
Direct linkDirect link
Corcoran, Stephanie – Contemporary School Psychology, 2022
With the iPad-mediated cognitive assessment gaining popularity with school districts and the need for alternative modes for training and instruction during this COVID-19 pandemic, school psychology training programs will need to adapt to effectively train their students to be competent in administering, scoring, an interpreting cognitive…
Descriptors: School Psychologists, Professional Education, Job Skills, Cognitive Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Turner, Megan I.; Van Norman, Ethan R.; Hojnoski, Robin L. – Journal of Psychoeducational Assessment, 2022
Star Math (SM) is a popular computer adaptive test (CAT) schools use to screen students for academic risk. Despite its popularity, few independent investigations of its diagnostic accuracy have been conducted. We evaluated the diagnostic accuracy of SM based upon vendor provided cut-scores (25th and 40th percentiles nationally) in predicting…
Descriptors: Accuracy, Adaptive Testing, Computer Assisted Testing, High Stakes Tests
Stefan Lorenz – ProQuest LLC, 2024
This dissertation develops and applies sophisticated Item Response Theory (IRT) methods to address fundamental measurement challenges in cognitive testing, focusing on the Armed Services Vocational Aptitude Battery (ASVAB) data from the National Longitudinal Survey of Youth (NLSY). The first chapter implements a confirmatory multidimensional IRT…
Descriptors: Human Capital, Item Response Theory, Vocational Aptitude, Armed Forces
Peer reviewed Peer reviewed
Direct linkDirect link
Olsho, Alexis; Smith, Trevor I.; Eaton, Philip; Zimmerman, Charlotte; Boudreaux, Andrew; White Brahmia, Suzanne – Physical Review Physics Education Research, 2023
We developed the Physics Inventory of Quantitative Literacy (PIQL) to assess students' quantitative reasoning in introductory physics contexts. The PIQL includes several "multiple-choice-multipleresponse" (MCMR) items (i.e., multiple-choice questions for which more than one response may be selected) as well as traditional single-response…
Descriptors: Multiple Choice Tests, Science Tests, Physics, Measures (Individuals)
Peer reviewed Peer reviewed
Direct linkDirect link
Guzman-Orth, Danielle; Steinberg, Jonathan; Albee, Traci – Language Testing, 2023
Standardizing accessible test design and development to meet students' individual access needs is a complex task. The following study provides one approach to accessible test design and development using participatory design methods with school community members. Participatory research provides opportunities to empower collaborators by co-creating…
Descriptors: English Language Learners, Blindness, Visual Impairments, Testing Accommodations
Peer reviewed Peer reviewed
Direct linkDirect link
Van Norman, Ethan R.; Forcht, Emily R. – Assessment for Effective Intervention, 2023
This study explored the validity of growth on two computer adaptive tests, Star Reading and Star Math, in explaining performance on an end-of-year achievement test for a sample of students in Grades 3 through 6. Results from quantile regression analyses indicate that growth on Star Reading explained a statistically significant amount of variance…
Descriptors: Test Validity, Computer Assisted Testing, Adaptive Testing, Grade Prediction
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Qaisar Khan; Sadia Ashraf – Bulletin of Education and Research, 2023
Assessment methods have more effects on the strategy of study; if an exam requires the recall of factual information, then students adopt the surface-level approach or rote learning (Newble & Jaeger, 1983). Measuring the learning outcomes of students is paramount for learning and teaching improvement. However, in the Pakistani education…
Descriptors: Testing Programs, Standardized Tests, High Stakes Tests, Rote Learning
Peer reviewed Peer reviewed
Direct linkDirect link
Gregory Chernov – Evaluation Review, 2025
Most existing solutions to the current replication crisis in science address only the factors stemming from specific poor research practices. We introduce a novel mechanism that leverages the experts' predictive abilities to analyze the root causes of replication failures. It is backed by the principle that the most accurate predictor is the most…
Descriptors: Replication (Evaluation), Prediction, Scientific Research, Failure
Pages: 1  |  ...  |  13  |  14  |  15  |  16  |  17  |  18  |  19  |  20  |  21  |  ...  |  3152