NotesFAQContact Us
Collection
Advanced
Search Tips
Audience
Researchers1
Laws, Policies, & Programs
No Child Left Behind Act 20011
What Works Clearinghouse Rating
Showing 1 to 15 of 45 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Pornphan Sureeyatanapas; Panitas Sureeyatanapas; Uthumporn Panitanarak; Jittima Kraisriwattana; Patchanan Sarootyanapat; Daniel O'Connell – Language Testing in Asia, 2024
Ensuring consistent and reliable scoring is paramount in education, especially in performance-based assessments. This study delves into the critical issue of marking consistency, focusing on speaking proficiency tests in English language learning, which often face greater reliability challenges. While existing literature has explored various…
Descriptors: Foreign Countries, Students, English Language Learners, Speech
Peer reviewed Peer reviewed
Direct linkDirect link
Carter, Jane – British Educational Research Journal, 2020
The Phonics Screening Check (PSC) was introduced in England in 2012 for Year 1 children (aged 5 and 6). There have been criticisms of the check in relation to its reliability and appropriateness as an assessment for early reading, although advocates of the check see it as a valuable tool in securing progress in early reading. This mixed methods…
Descriptors: Phonics, Teacher Attitudes, Socioeconomic Status, Testing Problems
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Allehaiby, Wid Hasen; Al-Bahlani, Sara – Arab World English Journal, 2021
One of the main challenges higher educational institutions encounter amid the recent COVID-19 crisis is transferring assessment approaches from the traditional face-to-face form to the online Emergency Remote Teaching approach. A set of language assessment principles, practicality, reliability, validity, authenticity, and washback, which can be…
Descriptors: Barriers, Distance Education, Evaluation Methods, Teaching Methods
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Phongsirikul, Marissa – rEFLections, 2018
The study aimed to investigate teachers' and students' perceptions towards traditional and alternative types of assessment within a classroom context of an English course provided for English-majoring students at tertiary level. A combination of traditional and alternative assessment tools was implemented in the study. The researcher developed…
Descriptors: Teacher Attitudes, Student Attitudes, Alternative Assessment, Second Language Learning
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Polat, Murat – International Journal of Psychology and Educational Studies, 2020
The application of high-stakes tests to choose students for higher education in Turkey has been considered as a reliable and effective way of assessment for so long. However, the application of a multiple-choice test in testing various skills could bring a number of side-effects with itself. This study aimed to investigate the backwash effect of…
Descriptors: Testing Problems, College Students, Student Attitudes, College Entrance Examinations
Davis, Michelle R. – Education Week, 2013
Widespread technical failures and interruptions of recent online testing in a number of states have shaken the confidence of educators and policymakers in high-tech assessment methods and raised serious concerns about schools' technological readiness for the coming common-core online tests. The glitches arose as many districts in the 46 states…
Descriptors: Computer Assisted Testing, Testing Problems, Reliability, Public Schools
Peer reviewed Peer reviewed
Direct linkDirect link
Davis, Andrew – Ethics and Education, 2015
PISA claims that it can extend its reach from its current core subjects of Reading, Science, Maths and problem-solving. Yet given the requirement for high levels of reliability for PISA, especially in the light of its current high stakes character, proposed widening of its subject coverage cannot embrace some important aspects of the social and…
Descriptors: International Assessment, High Stakes Tests, Reliability, Academic Achievement
Feinberg, Richard A. – ProQuest LLC, 2012
Subscores, also known as domain scores, diagnostic scores, or trait scores, can help determine test-takers' relative strengths and weaknesses and appropriately focus remediation. However, subscores often have poor psychometric properties, particularly reliability and distinctiveness (Folske, Gessaroli, & Swanson, 1999; Monaghan, 2006;…
Descriptors: Simulation, Tests, Testing, Scores
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Ghilay, Yaron; Ghilay, Ruth – Journal of Educational Technology, 2012
The study examined advantages and disadvantages of computerised assessment compared to traditional evaluation. It was based on two samples of college students (n=54) being examined in computerised tests instead of paper-based exams. Students were asked to answer a questionnaire focused on test effectiveness, experience, flexibility and integrity.…
Descriptors: Student Evaluation, Higher Education, Comparative Analysis, Computer Assisted Testing
Weiss, David J. – 1969
Today's psychological measurement depends almost exclusively on the "standardized test." A certain amount of non-standardization, however, exists in the administration of any standardized test, with the amount unknown for any given test score. Time limits on tests pose a bigger problem since another variable is introduced, pressure. Test taking…
Descriptors: Computer Oriented Programs, Individual Testing, Measurement Instruments, Motivation
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Tillinghast, B. S., Jr.; Renzulli, Joseph S. – Journal of Educational Research, 1968
The purpose of this study was to further examine the reliability of the Peabody Picture Vocabulary Test (PPVT), a new instrument to measure hearing vocabulary so that a student's verbal intelligence may be inferred. A group testing procedure was utilized by reproducing the PPVT plates on 35 millimeter transparent slides and projecting them onto a…
Descriptors: Aptitude Tests, Elementary School Students, Evaluation, Group Testing
Smith, Leon I.; Greenberg, Sandra – 1973
A discussion of selected applications of new tests developed within the context of a large-scale curriculum for educable mentally retarded (EMR) children, the Social Learning Curriculum (SLC), is presented in this paper which investigates three types of reliability that need to be demonstrated in order to provide a basis of these applications. The…
Descriptors: Curriculum Evaluation, Educational Research, Evaluation Methods, Measurement Techniques
Peer reviewed Peer reviewed
Kaiser, Henry F. – Educational and Psychological Measurement, 1980
The use of Bayes' estimates for proportions in the Law of Comparative Judgment is suggested to avoid sample proportions of zero and one. (Author)
Descriptors: Bayesian Statistics, Comparative Analysis, Reliability, Statistical Analysis
Helms, LuAnn Sherbeck – 1999
This paper discusses the fact that reliability is about scores and not tests and how reliability limits effect sizes. The paper also explores the classical reliability coefficients of stability, equivalence, and internal consistency. Stability is concerned with how stable test scores will be over time, while equivalence addresses the relationship…
Descriptors: Effect Size, Meta Analysis, Reliability, Scores
McVey, P. J. – Assessment in Higher Education, 1976
The results of 16 pairs of "equivalent papers" were used to estimate the reliability of the papers and the extent to which each paper correlated with the year's average test grade. Estimates were also made of the work of the grade for each paper as a predictor of true subject grades. It is shown that a "profile" of grades would mislead.…
Descriptors: Grades (Scholastic), Higher Education, Profiles, Reliability
Previous Page | Next Page ยป
Pages: 1  |  2  |  3