NotesFAQContact Us
Collection
Advanced
Search Tips
Laws, Policies, & Programs
No Child Left Behind Act 20011
What Works Clearinghouse Rating
Showing 1 to 15 of 50 results Save | Export
Benton, Tom; Williamson, Joanna – Research Matters, 2022
Equating methods are designed to adjust between alternate versions of assessments targeting the same content at the same level, with the aim that scores from the different versions can be used interchangeably. The statistical processes used in equating have, however, been extended to statistically "link" assessments that differ, such as…
Descriptors: Statistical Analysis, Equated Scores, Definitions, Alternative Assessment
Peer reviewed Peer reviewed
Direct linkDirect link
Sinharay, Sandip; Wan, Ping; Choi, Seung W.; Kim, Dong-In – Journal of Educational Measurement, 2015
With an increase in the number of online tests, the number of interruptions during testing due to unexpected technical issues seems to be on the rise. For example, interruptions occurred during several recent state tests. When interruptions occur, it is important to determine the extent of their impact on the examinees' scores. Researchers such as…
Descriptors: Computer Assisted Testing, Testing Problems, Scores, Statistical Analysis
Peer reviewed Peer reviewed
Direct linkDirect link
Sinharay, Sandip; Wan, Ping; Whitaker, Mike; Kim, Dong-In; Zhang, Litong; Choi, Seung W. – Journal of Educational Measurement, 2014
With an increase in the number of online tests, interruptions during testing due to unexpected technical issues seem unavoidable. For example, interruptions occurred during several recent state tests. When interruptions occur, it is important to determine the extent of their impact on the examinees' scores. There is a lack of research on this…
Descriptors: Computer Assisted Testing, Testing Problems, Scores, Regression (Statistics)
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Yu, Guoxing; He, Lianzhen; Rea-Dickins, Pauline; Kiely, Richard; Lu, Yanbin; Zhang, Jing; Zhang, Yan; Xu, Shasha; Fang, Lin – ETS Research Report Series, 2017
Language test preparation has often been studied within the consequential validity framework in relation to ethics, equity, fairness, and washback of assessment. The use of independent and integrated speaking tasks in the "TOEFL iBT"® test represents a significant development and innovation in assessing speaking ability in academic…
Descriptors: English (Second Language), Language Tests, Second Language Learning, Oral Language
Peer reviewed Peer reviewed
Direct linkDirect link
van Rijn, P. W.; Beguin, A. A.; Verstralen, H. H. F. M. – Assessment in Education: Principles, Policy & Practice, 2012
While measurement precision is relatively easy to establish for single tests and assessments, it is much more difficult to determine for decision making with multiple tests on different subjects. This latter is the situation in the system of final examinations for secondary education in the Netherlands and is used as an example in this paper. This…
Descriptors: Secondary Education, Tests, Foreign Countries, Decision Making
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Kang, Che Chang – English Language Teaching, 2014
The study aimed at investigating TOEIC score distribution patterns and learner satisfaction in an intensive TOEIC course and drew implications for pedagogical practice. A one-group pre-test post-test experiment and a survey on learner satisfaction were conducted on Taiwanese college EFL students (n = 50) in a case study. Results showed that the…
Descriptors: Teaching Methods, Second Language Learning, Second Language Instruction, English (Second Language)
Dawson, Heather S. – ProQuest LLC, 2012
High-stakes testing has created challenges for teachers, administrators, parents, students, and other related education stakeholders in recent decades (Nichols & Berliner, 2007). While high-stakes tests have a long history (Ravitch, 2009) it was not until No Child Left Behind was signed into law in 2002 that the tests became law for most…
Descriptors: Beliefs, High Stakes Tests, Teacher Motivation, Teacher Attitudes
Peer reviewed Peer reviewed
Cureton, Edward E. – Educational and Psychological Measurement, 1971
A derivation of a formula for the stability coefficient is presented and discussed in terms of test reliability over time. (PR)
Descriptors: Error of Measurement, Raw Scores, Statistical Analysis, Test Reliability
Peer reviewed Peer reviewed
Boldt, R. F. – Educational and Psychological Measurement, 1974
Descriptors: Comparative Testing, Equated Scores, National Norms, Raw Scores
Andrulis, Richard S.; And Others – 1974
The purpose of this investigation was to establish the effects of repeaters on test equating. Since consideration was not given to repeaters in test equating, such as in the derivation of equations by Angoff (1971), the hypothetical effect needed to be established. A case study was examined which showed results on a test as expected; overall mean…
Descriptors: Cutting Scores, Equated Scores, Recall (Psychology), Retention (Psychology)
Barker, Pierce; Pelavin, Sol H. – 1976
This study was mounted to assess the validity of standard score transformations of raw test scores and test bias on the 1970 edition of the Metropolitan Achievement Test Battery, in the context of a controversial federally funded compensatory education program, the Educational Voucher Demonstration (EVD). On an individual level the validity of the…
Descriptors: Achievement Gains, Achievement Tests, Educationally Disadvantaged, Elementary Education
Echternacht, Gary; Plas, Jeanne M. – NCME, 1977
While most school districts believe they understand grade equivalent scores, teachers, parents, and measurement specialists frequently misinterpret this apparently simple statistical expression. Echternacht's article describes the construction, application, and interpretation of grade equivalent scores from the test publisher's perspective.…
Descriptors: Achievement Rating, Achievement Tests, Elementary Education, Grade Equivalent Scores
Peer reviewed Peer reviewed
Brennan, Robert L.; Lockwood, Robert E. – Applied Psychological Measurement, 1980
Generalizability theory is used to characterize and quantify expected variance in cutting scores and to compare the Nedelsky and Angoff procedures for establishing a cutting score. Results suggest that the restricted nature of the Nedelsky (inferred) probability scale may limit its applicability in certain contexts. (Author/BW)
Descriptors: Cutting Scores, Generalization, Statistical Analysis, Test Reliability
Sarvela, Paul D. – 1986
Four discrimination indices were compared, using score distributions which were normal, bimodal, and negatively skewed. The score distributions were systematically varied to represent the common circumstances of a military training situation using criterion-referenced mastery tests. Three 20-item tests were administered to 110 simulated subjects.…
Descriptors: Comparative Analysis, Criterion Referenced Tests, Item Analysis, Mastery Tests
Modu, Christopher C.; Stern, June – 1977
To assess the stability of the Scholastic Aptitude Test verbal score scale SAT--V, 1963 and 1973 forms of the SAT--V were administered in counterbalanced order to spaced samples of the same group. The 1973 scores were placed on the reporting scale used for the 1963 form. The experimentally derived scores on the 1963 scale were compared with their…
Descriptors: College Bound Students, College Entrance Examinations, Educational Problems, Educational Trends
Previous Page | Next Page »
Pages: 1  |  2  |  3  |  4