NotesFAQContact Us
Collection
Advanced
Search Tips
What Works Clearinghouse Rating
Showing 1 to 15 of 215 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Jones, Paul; Tong, Ye; Liu, Jinghua; Borglum, Joshua; Primoli, Vince – Journal of Educational Measurement, 2022
This article studied two methods to detect mode effects in two credentialing exams. In Study 1, we used a "modal scale comparison approach," where the same pool of items was calibrated separately, without transformation, within two TC cohorts (TC1 and TC2) and one OP cohort (OP1) matched on their pool-based scale score distributions. The…
Descriptors: Scores, Credentials, Licensing Examinations (Professions), Computer Assisted Testing
NWEA, 2018
Some schools use results from the MAP® Growth™ interim assessments from the Northwest Evaluation Association (NWEA®) in a number of high-stakes ways. These include as a component of their teacher evaluation systems; to determine whether a student advances to the next grade; or as an indicator for student readiness for certain programs or…
Descriptors: High Stakes Tests, Guidelines, School Districts, Intervention
Peer reviewed Peer reviewed
Direct linkDirect link
Simper, Natalie; Frank, Brian; Kaupp, Jake; Mulligan, Nerissa; Scott, Jill – Assessment & Evaluation in Higher Education, 2019
Critical thinking, problem solving and communication are fundamental elements of undergraduate education, but methods for assessing these skills across an institution are susceptible to logistical, motivational and financial issues. Queen's University conducted two research studies investigating the use of standardised tests to assess cognitive…
Descriptors: Foreign Countries, Student Evaluation, Standardized Tests, Undergraduate Students
Peer reviewed Peer reviewed
Direct linkDirect link
Jerrim, John; Micklewright, John; Heine, Jorg-Henrik; Salzer, Christine; McKeown, Caroline – Oxford Review of Education, 2018
The Programme for International Student Assessment (PISA) is an important cross-national study of 15-year-olds' academic knowledge and skills. Educationalists and public policymakers eagerly await the tri-annual results, with particular interest in whether their country has moved up or slid down the international rankings, as compared to earlier…
Descriptors: Foreign Countries, Achievement Tests, International Assessment, Secondary School Students
Peer reviewed Peer reviewed
Direct linkDirect link
Jerrim, John – Assessment in Education: Principles, Policy & Practice, 2016
The Programme for International Assessment (PISA) is an important cross-national study of 15-year olds academic achievement. Although it has traditionally been conducted using paper-and-pencil tests, the vast majority of countries will use computer-based assessment from 2015. In this paper, we consider how cross-country comparisons of children's…
Descriptors: Foreign Countries, Achievement Tests, International Assessment, Secondary School Students
Peer reviewed Peer reviewed
Direct linkDirect link
Baker, Eva L. – Educational Researcher, 2016
This article investigates the persistent and change elements of educational testing and assessment from 1920 to the present day. I show by examining the addresses and texts of American Educational Research Association presidents a continuing focus on schools, from early experiments and development up through applications in accountability systems.…
Descriptors: Research, Educational Testing, Presidents, Professional Associations
Peer reviewed Peer reviewed
Direct linkDirect link
Frein, Scott T. – Teaching of Psychology, 2011
This article describes three experiments comparing paper-and-pencil tests (PPTs) to computer-based tests (CBTs) in terms of test method preferences and student performance. In Experiment 1, students took tests using three methods: PPT in class, CBT in class, and CBT at the time and place of their choosing. Results indicate that test method did not…
Descriptors: College Students, Psychology, Introductory Courses, Comparative Analysis
Peer reviewed Peer reviewed
Direct linkDirect link
Jaeger, Martin; Adair, Desmond – European Journal of Engineering Education, 2017
Online quizzes have been shown to be effective learning and assessment approaches. However, if scenario-based online construction safety quizzes do not include time pressure similar to real-world situations, they reflect situations too ideally. The purpose of this paper is to compare engineering students' performance when carrying out an online…
Descriptors: Engineering Education, Quasiexperimental Design, Tests, Academic Achievement
Peer reviewed Peer reviewed
Direct linkDirect link
Freund, Philipp Alexander; Holling, Heinz – Intelligence, 2011
The interpretation of retest scores is problematic because they are potentially affected by measurement and predictive bias, which impact construct validity, and because their size differs as a function of various factors. This paper investigates the construct stability of scores on a figural matrices test and models retest effects at the level of…
Descriptors: Intelligence, Test Results, Individual Testing, Construct Validity
Peer reviewed Peer reviewed
Direct linkDirect link
Delen, Erhan – EURASIA Journal of Mathematics, Science & Technology Education, 2015
As technology has become more advanced and accessible in instructional settings, there has been an upward trend in computer-based testing in the last decades. The present experimental study examines students' behaviors during computer-based testing in two different conditions and explores how these conditions affect the test results. Results…
Descriptors: Foreign Countries, Computer Assisted Testing, Student Behavior, Test Results
Peer reviewed Peer reviewed
Direct linkDirect link
Rusticus, Shayna A.; Lovato, Chris Y. – Practical Assessment, Research & Evaluation, 2011
Assessing the comparability of different groups is an issue facing many researchers and evaluators in a variety of settings. Commonly, null hypothesis significance testing (NHST) is incorrectly used to demonstrate comparability when a non-significant result is found. This is problematic because a failure to find a difference between groups is not…
Descriptors: Medical Education, Evaluators, Intervals, Testing
Snyder, James – ProQuest LLC, 2010
This dissertation research examined the changes in item RIT calibration that occurred when adding audio to a set of currently calibrated RIT items and then placing these new items as field test items in the modified assessments on the NWEA MAP test platform. The researcher used test results from over 600 students in the Poway School District in…
Descriptors: Test Results, Test Items, Field Tests, Data Analysis
Peer reviewed Peer reviewed
Direct linkDirect link
Lissitz, Robert W.; Hou, Xiaodong; Slater, Sharon Cadman – Journal of Applied Testing Technology, 2012
This article investigates several questions regarding the impact of different item formats on measurement characteristics. Constructed response (CR) items and multiple choice (MC) items obviously differ in their formats and in the resources needed to score them. As such, they have been the subject of considerable discussion regarding the impact of…
Descriptors: Computer Assisted Testing, Scoring, Evaluation Problems, Psychometrics
Peer reviewed Peer reviewed
Direct linkDirect link
Steedle, Jeffrey; Kugelmass, Heather; Nemeth, Alex – Change: The Magazine of Higher Learning, 2010
Many postsecondary institutions currently administer standardized tests of general college outcomes; more than a quarter of Association of American Colleges and Universities (AAC&U) member institutions do so. Using standardized tests for accountability purposes has been contentious mainly because these tests do not measure every important…
Descriptors: Test Results, Standardized Tests, Test Validity, Educational Testing
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Neary-Sundquist, Colleen – Studies in Second Language Learning and Teaching, 2014
This study investigates the use of pragmatic markers (PMs) by learners of English at varying proficiency levels. The study analyzes data from a university-level oral proficiency exam that categorized Chinese and Korean English-as-a-second-language (ESL) speakers into four proficiency levels and compares data with those of native speakers taking…
Descriptors: Pragmatics, Second Language Learning, English (Second Language), Language Proficiency
Previous Page | Next Page »
Pages: 1  |  2  |  3  |  4  |  5  |  6  |  7  |  8  |  9  |  10  |  11  |  ...  |  15