NotesFAQContact Us
Collection
Advanced
Search Tips
Laws, Policies, & Programs
What Works Clearinghouse Rating
Showing 1 to 15 of 19 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Hwanggyu Lim; Danqi Zhu; Edison M. Choe; Kyung T. Han – Journal of Educational Measurement, 2024
This study presents a generalized version of the residual differential item functioning (RDIF) detection framework in item response theory, named GRDIF, to analyze differential item functioning (DIF) in multiple groups. The GRDIF framework retains the advantages of the original RDIF framework, such as computational efficiency and ease of…
Descriptors: Item Response Theory, Test Bias, Test Reliability, Test Construction
Peer reviewed Peer reviewed
Direct linkDirect link
Wang, Shiyu; Lin, Haiyan; Chang, Hua-Hua; Douglas, Jeff – Journal of Educational Measurement, 2016
Computerized adaptive testing (CAT) and multistage testing (MST) have become two of the most popular modes in large-scale computer-based sequential testing. Though most designs of CAT and MST exhibit strength and weakness in recent large-scale implementations, there is no simple answer to the question of which design is better because different…
Descriptors: Computer Assisted Testing, Adaptive Testing, Test Format, Sequential Approach
Peer reviewed Peer reviewed
Direct linkDirect link
Cimbricz, Sandra K.; McConn, Matthew L. – Changing English: Studies in Culture and Education, 2015
This article explores the intersection of new, large-scale standards-based testing, teacher accountability policy, and secondary curriculum and instruction in the United States. Two federally funded consortia--the Smarter Balanced Assessment Consortium and the Partnership for Readiness of College and Careers--prove focal to this paper, as these…
Descriptors: Group Testing, English Instruction, Secondary School Curriculum, Accountability
Peer reviewed Peer reviewed
Direct linkDirect link
St Clair-Thompson, Helen – Journal of Psychoeducational Assessment, 2014
The aim of the present study was to investigate the reliability and validity of a brief standardized assessment of children's working memory; "Lucid Recall." Although there are many established assessments of working memory, "Lucid Recall" is fully automated and can therefore be administered in a group setting. It is therefore…
Descriptors: Test Reliability, Test Validity, Computer Assisted Testing, Cognitive Tests
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Yaratan, Huseyin; Suphi, Nilgun – Turkish Online Journal of Educational Technology - TOJET, 2013
Questionnaires administered manually can cause surreptitious peer pressure on the candidate to finish when 'the others" have completed theirs, forcing students to rush or skip individual items or may hinder the ability of noticing participants who may be having difficulty understanding certain items. These drawbacks can have serious…
Descriptors: Synchronous Communication, Questionnaires, Computer Assisted Testing, Undergraduate Students
Rogers, W. Todd – Canadian Journal of Education, 2014
Principals and teachers do not use large-scale assessment results because the lack of distinct and reliable subtests prevents identifying strengths and weaknesses of students and instruction, the results arrive too late to be used, and principals and teachers need assistance to use the results to improve instruction so as to improve student…
Descriptors: Foreign Countries, Group Testing, Multidimensional Scaling, Evaluation Utilization
Peer reviewed Peer reviewed
Direct linkDirect link
Phillips, Gary W. – Applied Measurement in Education, 2015
This article proposes that sampling design effects have potentially huge unrecognized impacts on the results reported by large-scale district and state assessments in the United States. When design effects are unrecognized and unaccounted for they lead to underestimating the sampling error in item and test statistics. Underestimating the sampling…
Descriptors: State Programs, Sampling, Research Design, Error of Measurement
Peer reviewed Peer reviewed
Direct linkDirect link
Camilli, Gregory – Educational Research and Evaluation, 2013
In the attempt to identify or prevent unfair tests, both quantitative analyses and logical evaluation are often used. For the most part, fairness evaluation is a pragmatic attempt at determining whether procedural or substantive due process has been accorded to either a group of test takers or an individual. In both the individual and comparative…
Descriptors: Alternative Assessment, Test Bias, Test Content, Test Format
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Zacharis, Nick Z. – Contemporary Issues in Education Research, 2010
Although summative assessment is indispensable for determining whether or not students meet the content standards, it alone is insufficient for providing teachers and administrators with the information necessary to make ongoing decisions about instruction. This article looks at the motivational impact of the assessment on students' achievement…
Descriptors: Educational Practices, Educational Innovation, Alternative Assessment, Summative Evaluation
Peer reviewed Peer reviewed
Direct linkDirect link
Macqueen, Susy; Harding, Luke – Language Testing, 2009
In 2002 the University of Cambridge Local Examinations Syndicate (UCLES) implemented a revised version of the Certificate of Proficiency in English (CPE). CPE, which is the highest level of the Main Suite of Cambridge ESOL exams, comprises five modules, "Reading," "Writing," "Use of English," "Listening" and "Speaking," the latter of which is the…
Descriptors: Speech Communication, Test Reviews, Examiners, English (Second Language)
Peer reviewed Peer reviewed
Direct linkDirect link
Horton, Simon; de Lourdes Drachler, Maria; Fuller, Alison; de Carvalho Leite, Jose Carlos – International Journal of Language & Communication Disorders, 2008
Background: In the UK clinical supervision is regarded as an essential process supporting quality improvement within the clinical governance framework, and the Royal College of Speech and Language Therapists regards it as a tool for promoting critical reflective practice. There is limited evidence of the impact on practice or improvements in…
Descriptors: Health Personnel, Supervision, Questionnaires, Factor Analysis
Peer reviewed Peer reviewed
Direct linkDirect link
Keating, Daniel P. – Early Education and Development, 2007
This article is a commentary for the special issue on the Early Development Instrument (EDI), a community tool to assess children's school readiness and developmental outcomes at a group level. The EDI is administered by kindergarten teachers, who assess their kindergarten students on 5 developmental domains: physical health and well-being, social…
Descriptors: School Readiness, Formative Evaluation, Kindergarten, Cognitive Development
Peer reviewed Peer reviewed
Krieshok, Thomas S.; Harrington, Robert G. – Journal of Counseling & Development, 1985
Reviews the administrative features, uses, development, standardization, reliability, and validity of the Multidimensional Aptitude Battery (MAB), a new group intelligence test designed to be a paper-and-pencil parallel to the Wechsler Adult Intelligence Scale-Revised (WAIS-R). (BL)
Descriptors: Group Testing, Intelligence Tests, Test Construction, Test Reliability
Peer reviewed Peer reviewed
Hubbard, J. I.; Seddon, G. M. – Research in Science and Technological Education, 1989
Investigates differences in marking standard and reliability when experienced teachers carried out assessments of the performance on practical exercises. The results showed that there was no difference between the assessments from the groups containing 5 and 20 students. (Author/YP)
Descriptors: Evaluation Research, Foreign Countries, Group Testing, Science Teachers
Peer reviewed Peer reviewed
Wodtke, Kenneth H.; And Others – Educational Evaluation and Policy Analysis, 1989
A qualitative observational study of standardized group testing in 10 kindergartens revealed variations in testing conditions, discrepancies from standardized administration procedures, and variations in children's behavior that contributed to difficulties in maintaining a uniform testing process. High-stakes group testing in kindergarten should…
Descriptors: Classroom Observation Techniques, Group Testing, Kindergarten, Primary Education
Previous Page | Next Page ยป
Pages: 1  |  2