NotesFAQContact Us
Collection
Advanced
Search Tips
Audience
Laws, Policies, & Programs
No Child Left Behind Act 20011
What Works Clearinghouse Rating
Showing all 14 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Jones, Paul; Tong, Ye; Liu, Jinghua; Borglum, Joshua; Primoli, Vince – Journal of Educational Measurement, 2022
This article studied two methods to detect mode effects in two credentialing exams. In Study 1, we used a "modal scale comparison approach," where the same pool of items was calibrated separately, without transformation, within two TC cohorts (TC1 and TC2) and one OP cohort (OP1) matched on their pool-based scale score distributions. The…
Descriptors: Scores, Credentials, Licensing Examinations (Professions), Computer Assisted Testing
Peer reviewed Peer reviewed
Direct linkDirect link
Madya, Suwarsih; Retnawati, Heri; Purnawan, Ari; Putro, Nur Hidayanto Pancoro Setyo; Apino, Ezi – TEFLIN Journal: A publication on the teaching and learning of English, 2019
This explorative-descriptive study set out to examine the equivalence among Test of English Proficiency (TOEP) forms, developed by the Indonesian Testing Service Centre (ITSC) and co-founded by The Association for The Teaching of English as a Foreign Language in Indonesia (TEFLIN) and The Association of Psychology in Indonesia. Using a…
Descriptors: Language Tests, Language Proficiency, English (Second Language), Second Language Learning
Peer reviewed Peer reviewed
Direct linkDirect link
Jerrim, John; Micklewright, John; Heine, Jorg-Henrik; Salzer, Christine; McKeown, Caroline – Oxford Review of Education, 2018
The Programme for International Student Assessment (PISA) is an important cross-national study of 15-year-olds' academic knowledge and skills. Educationalists and public policymakers eagerly await the tri-annual results, with particular interest in whether their country has moved up or slid down the international rankings, as compared to earlier…
Descriptors: Foreign Countries, Achievement Tests, International Assessment, Secondary School Students
Peer reviewed Peer reviewed
Direct linkDirect link
Jaeger, Martin; Adair, Desmond – European Journal of Engineering Education, 2017
Online quizzes have been shown to be effective learning and assessment approaches. However, if scenario-based online construction safety quizzes do not include time pressure similar to real-world situations, they reflect situations too ideally. The purpose of this paper is to compare engineering students' performance when carrying out an online…
Descriptors: Engineering Education, Quasiexperimental Design, Tests, Academic Achievement
Peer reviewed Peer reviewed
Direct linkDirect link
Han, Jing; Bao, Lei; Chen, Li; Cai, Tianfang; Pi, Yuan; Zhou, Shaona; Tu, Yan; Koenig, Kathleen – Physical Review Special Topics - Physics Education Research, 2015
The Force Concept Inventory (FCI) is a 30-question multiple-choice assessment that has been a building block for much of the physics education research done today. In practice, there are often concerns regarding the length of the test and possible test-retest effects. Since many studies in the literature use the mean score of the FCI as the…
Descriptors: Physics, Multiple Choice Tests, Science Instruction, Scores
Peer reviewed Peer reviewed
Direct linkDirect link
Lissitz, Robert W.; Hou, Xiaodong; Slater, Sharon Cadman – Journal of Applied Testing Technology, 2012
This article investigates several questions regarding the impact of different item formats on measurement characteristics. Constructed response (CR) items and multiple choice (MC) items obviously differ in their formats and in the resources needed to score them. As such, they have been the subject of considerable discussion regarding the impact of…
Descriptors: Computer Assisted Testing, Scoring, Evaluation Problems, Psychometrics
Peer reviewed Peer reviewed
Direct linkDirect link
Pyle, Katie; Jones, Emily; Williams, Chris; Morrison, Jo – Educational Research, 2009
Background: All national curriculum tests in England are pre-tested as part of the development process. Differences in pupil performance between pre-test and live test are consistently found. This difference has been termed the pre-test effect. Understanding the pre-test effect is essential in the test development and selection processes and in…
Descriptors: Foreign Countries, Pretesting, Context Effect, National Curriculum
Christensen, Laurene L. – ProQuest LLC, 2010
This study investigated the inclusion of English language learners (ELLs) in state standards and assessments, as measured by comments made by peer reviewers in the federal evaluation of states' standards and assessments. As required by the Elementary and Secondary Education Act (ESEA), reauthorized in 2004 as No Child Left Behind (NCLB), states…
Descriptors: Elementary Secondary Education, Federal Legislation, Research Methodology, State Standards
Stocking, Martha L. – 1988
The construction of parallel editions of conventional tests for purposes of test security while maintaining score comparability has always been a recognized and difficult problem in psychometrics and test construction. The introduction of new modes of test construction, e.g., adaptive testing, changes the nature of the problem, but does not make…
Descriptors: Adaptive Testing, Comparative Analysis, Computer Assisted Testing, Identification
Peer reviewed Peer reviewed
Straetmans, Gerard J. J. M.; Eggen, Theo J. H. M. – Educational Research and Evaluation (An International Journal on Theory and Practice), 1998
Three test administration procedures for making placement decisions in adult education were compared (paper-based, computer-based, and computerized-adaptive tests) with 90 adult-education students. Test performance was not differentially affected by the mode of administration, but the computerized adaptive test always yielded more precise ability…
Descriptors: Ability, Adaptive Testing, Adult Education, Adult Students
Peer reviewed Peer reviewed
Klein, Stephen P.; And Others – Educational Evaluation and Policy Analysis, 1997
Whether differences in mean scores among gender and racial/ethnic groups on science performance assessments are comparable to those for traditional tests was studied with 2,000 students in grades five, six, and nine. Overall, results suggest that the type of test has little effect on these differences in scores. (SLD)
Descriptors: Comparative Analysis, Cultural Differences, Ethnic Groups, Performance Based Assessment
Schaeffer, Gary A.; And Others – 1995
This report summarizes the results from two studies. The first assessed the comparability of scores derived from linear computer-based (CBT) and computer adaptive (CAT) versions of the three Graduate Record Examinations (GRE) General Test measures. A verbal CAT was taken by 1,507, a quantitative CAT by 1,354, and an analytical CAT by 995…
Descriptors: Adaptive Testing, Comparative Analysis, Computer Assisted Testing, Equated Scores
Bay, Luz – 1998
A study was conducted to investigate the difference in student performance on multiple choice (MC) and constructed response (CR) items relative to the achievement levels of the National Assessment of Educational Progress (NAEP). The study included an investigation of how estimates of student performance were affected by item response theory (IRT)…
Descriptors: Academic Achievement, Comparative Analysis, Constructed Response, Cutting Scores
York Region Board of Education, Aurora (Ontario). – 1986
To determine whether students enrolled in one Ontario region's early French immersion (FI) programs developed English reading skills comparable to their non-FI peers, a monitoring process was begun in the first FI program year (grade 3) in which formal English instruction is given. The FI cohort and a control group matched for mental abilities and…
Descriptors: Comparative Analysis, Elementary Education, English, Foreign Countries