Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 13 |
Since 2006 (last 20 years) | 45 |
Descriptor
Scoring | 91 |
Statistical Analysis | 91 |
Computer Assisted Testing | 35 |
Testing | 26 |
Scores | 22 |
Test Construction | 21 |
Correlation | 19 |
Foreign Countries | 17 |
Test Items | 16 |
Test Reliability | 15 |
Comparative Analysis | 14 |
More ▼ |
Source
Author
Attali, Yigal | 2 |
Davey, Tim | 2 |
Echternacht, Gary | 2 |
Kim, Sooyeon | 2 |
Livingston, Samuel A. | 2 |
Puhan, Gautam | 2 |
Ramineni, Chaitanya | 2 |
Williamson, David M. | 2 |
Abe, Mariko | 1 |
Adams, Deanne M. | 1 |
Ali, Usama S. | 1 |
More ▼ |
Publication Type
Education Level
Higher Education | 17 |
Postsecondary Education | 13 |
Elementary Secondary Education | 5 |
Secondary Education | 5 |
Elementary Education | 4 |
Middle Schools | 3 |
Grade 8 | 2 |
High Schools | 2 |
Junior High Schools | 2 |
Grade 7 | 1 |
Audience
Researchers | 3 |
Practitioners | 2 |
Teachers | 2 |
Parents | 1 |
Location
Japan | 4 |
Australia | 2 |
California | 2 |
Brazil | 1 |
China | 1 |
Denmark | 1 |
Illinois | 1 |
Israel | 1 |
Malaysia | 1 |
Maryland | 1 |
Michigan | 1 |
More ▼ |
Laws, Policies, & Programs
Elementary and Secondary… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Meets WWC Standards without Reservations | 1 |
Meets WWC Standards with or without Reservations | 1 |
Puhan, Gautam; Kim, Sooyeon – Journal of Educational Measurement, 2022
As a result of the COVID-19 pandemic, at-home testing has become a popular delivery mode in many testing programs. When programs offer at-home testing to expand their service, the score comparability between test takers testing remotely and those testing in a test center is critical. This article summarizes statistical procedures that could be…
Descriptors: Scores, Scoring, Comparative Analysis, Testing
Buzick, Heather; Oliveri, Maria Elena; Attali, Yigal; Flor, Michael – Applied Measurement in Education, 2016
Automated essay scoring is a developing technology that can provide efficient scoring of large numbers of written responses. Its use in higher education admissions testing provides an opportunity to collect validity and fairness evidence to support current uses and inform its emergence in other areas such as K-12 large-scale assessment. In this…
Descriptors: Essays, Learning Disabilities, Attention Deficit Hyperactivity Disorder, Scoring
Cohen, Yoav; Levi, Effi; Ben-Simon, Anat – Applied Measurement in Education, 2018
In the current study, two pools of 250 essays, all written as a response to the same prompt, were rated by two groups of raters (14 or 15 raters per group), thereby providing an approximation to the essay's true score. An automated essay scoring (AES) system was trained on the datasets and then scored the essays using a cross-validation scheme. By…
Descriptors: Test Validity, Automation, Scoring, Computer Assisted Testing
Kim, Sooyeon; Livingston, Samuel A. – ETS Research Report Series, 2017
The purpose of this simulation study was to assess the accuracy of a classical test theory (CTT)-based procedure for estimating the alternate-forms reliability of scores on a multistage test (MST) having 3 stages. We generated item difficulty and discrimination parameters for 10 parallel, nonoverlapping forms of the complete 3-stage test and…
Descriptors: Accuracy, Test Theory, Test Reliability, Adaptive Testing
Zimmerman, Whitney Alicia; Kang, Hyun Bin; Kim, Kyung; Gao, Mengzhao; Johnson, Glenn; Clariana, Roy; Zhang, Fan – Journal of Statistics Education, 2018
Over two semesters short essay prompts were developed for use with the Graphical Interface for Knowledge Structure (GIKS), an automated essay scoring system. Participants were students in an undergraduate-level online introductory statistics course. The GIKS compares students' writing samples with an expert's to produce keyword occurrence and…
Descriptors: Undergraduate Students, Introductory Courses, Statistics, Computer Assisted Testing
Ali, Usama S.; Chang, Hua-Hua – ETS Research Report Series, 2014
Adaptive testing is advantageous in that it provides more efficient ability estimates with fewer items than linear testing does. Item-driven adaptive pretesting may also offer similar advantages, and verification of such a hypothesis about item calibration was the main objective of this study. A suitability index (SI) was introduced to adaptively…
Descriptors: Adaptive Testing, Simulation, Pretests Posttests, Test Items
Longford, Nicholas T. – Journal of Educational and Behavioral Statistics, 2015
An equating procedure for a testing program with evolving distribution of examinee profiles is developed. No anchor is available because the original scoring scheme was based on expert judgment of the item difficulties. Pairs of examinees from two administrations are formed by matching on coarsened propensity scores derived from a set of…
Descriptors: Equated Scores, Testing Programs, College Entrance Examinations, Scoring
Gehsmann, Kristin; Spichtig, Alexandra; Tousley, Elias – Literacy Research: Theory, Method, and Practice, 2017
Assessments of developmental spelling, also called spelling inventories, are commonly used to understand students' orthographic knowledge (i.e., knowledge of how written words work) and to determine their stages of spelling and reading development. The information generated by these assessments is used to inform teachers' grouping practices and…
Descriptors: Spelling, Computer Assisted Testing, Grouping (Instructional Purposes), Teaching Methods
Vázquez-Alonso, Ángel; Manassero-Mas, María-Antonia; García-Carmona, Antonio; Montesano de Talavera, Marisa – Asia-Pacific Forum on Science Learning and Teaching, 2016
This study applies a new quantitative methodological approach to diagnose epistemology conceptions in a large sample. The analyses use seven multiple-rating items on the epistemology of science drawn from the item pool Views on Science-Technology-Society (VOSTS). The bases of the new methodological diagnostic approach are the empirical…
Descriptors: Epistemology, Statistical Analysis, Science and Society, Scientific Principles
Oliveri, María Elena; von Davier, Alina A. – International Journal of Testing, 2016
In this study, we propose that the unique needs and characteristics of linguistic minorities should be considered throughout the test development process. Unlike most measurement invariance investigations in the assessment of linguistic minorities, which typically are conducted after test administration, we propose strategies that focus on the…
Descriptors: Psychometrics, Linguistics, Test Construction, Testing
Bainter, Sierra A.; Curran, Patrick J. – Journal of Cognition and Development, 2015
Amid recent progress in cognitive development research, high-quality data resources are accumulating, and data sharing and secondary data analysis are becoming increasingly valuable tools. Integrative data analysis (IDA) is an exciting analytical framework that can enhance secondary data analysis in powerful ways. IDA pools item-level data across…
Descriptors: Data Analysis, Integrated Activities, Inferences, Statistical Analysis
McDonald, Christin A.; Volker, Martin A.; Lopata, Christopher; Toomey, Jennifer A.; Thomeer, Marcus L.; Lee, Gloria K.; Lipinski, Alanna M.; Dua, Elissa H.; Schiavo, Audrey M.; Bain, Fabienne; Nelson, Andrew T. – Journal of Psychoeducational Assessment, 2014
The visual-motor skills of 90 youth with high-functioning autism spectrum disorders (HFASDs) and 51 typically developing (TD) youth were assessed using the Beery-Buktenica Developmental Test of Visual-Motor Integration, Sixth Edition (VMI-VI) and Koppitz Developmental Scoring System for the Bender-Gestalt Test-Second Edition (KOPPITZ-2).…
Descriptors: Perceptual Motor Coordination, Autism, Pervasive Developmental Disorders, Comparative Analysis
Mrazik, Martin; Janzen, Troy M.; Dombrowski, Stefan C.; Barford, Sean W.; Krawchuk, Lindsey L. – Canadian Journal of School Psychology, 2012
A total of 19 graduate students enrolled in a graduate course conducted 6 consecutive administrations of the Wechsler Intelligence Scale for Children, 4th edition (WISC-IV, Canadian version). Test protocols were examined to obtain data describing the frequency of examiner errors, including administration and scoring errors. Results identified 511…
Descriptors: Intelligence Tests, Intelligence, Statistical Analysis, Scoring
Keller-Margulis, Milena A.; Mercer, Sterett H.; Payan, Anita; McGee, Wendy – School Psychology Quarterly, 2015
The purpose of this study was to examine annual growth patterns and gender differences in written expression curriculum-based measurement (WE-CBM) when used in the context of universal screening. Students in second through fifth grade (n = 672) from 2 elementary schools that used WE-CBM as a universal screener participated in the study. Student…
Descriptors: Gender Differences, Curriculum Based Assessment, Elementary School Students, Writing Skills
Mao, Liyang; Liu, Ou Lydia; Roohr, Katrina; Belur, Vinetha; Mulholland, Matthew; Lee, Hee-Sun; Pallant, Amy – Educational Assessment, 2018
Scientific argumentation is one of the core practices for teachers to implement in science classrooms. We developed a computer-based formative assessment to support students' construction and revision of scientific arguments. The assessment is built upon automated scoring of students' arguments and provides feedback to students and teachers.…
Descriptors: Computer Assisted Testing, Science Tests, Scoring, Automation