NotesFAQContact Us
Collection
Advanced
Search Tips
What Works Clearinghouse Rating
Showing 256 to 270 of 1,166 results Save | Export
Ryan, Joseph; Brockmann, Frank – Council of Chief State School Officers, 2009
Equating is an essential tool in educational assessment due the critical role it plays in several key areas: establishing validity across forms and years; fairness; test security; and, increasingly, continuity in programs that release items or require ongoing development. Although the practice of equating is rooted in long standing practices that…
Descriptors: Equated Scores, Test Theory, Item Response Theory, Educational Assessment
Peer reviewed Peer reviewed
Direct linkDirect link
van der Linden, Wim J. – Journal of Educational Measurement, 2009
Two different traditions of response-time (RT) modeling are reviewed: the tradition of distinct models for RTs and responses, and the tradition of model integration in which RTs are incorporated in response models or the other way around. Several conceptual issues underlying both traditions are made explicit and analyzed for their consequences. We…
Descriptors: Test Items, Models, Reaction Time, Measurement
Peer reviewed Peer reviewed
Direct linkDirect link
Scherman, Vanessa; Howie, Sarah J.; Bosker, Roel J. – Educational Research and Evaluation, 2011
In information-rich environments, schools are often presented with a myriad of data from which decisions need to be made. The use of the information on a classroom level may be facilitated if performance could be described in terms of levels of proficiency or benchmarks. The aim of this article is to explore benchmarks using data from a monitoring…
Descriptors: Standard Setting, Foreign Countries, Grade 8, Ability
Peer reviewed Peer reviewed
Direct linkDirect link
McGrath, Helen; O'Toole, Thomas – European Journal of Training and Development, 2012
Purpose: The main aim of this paper is to develop guidelines on the critical issues to consider in research design in an action research (AR) environment for SME network capability development. Design/methodology/approach: The issues in research design for AR studies are developed from the authors' experience in running learning sets but, in…
Descriptors: Research Design, Action Research, Research Methodology, Data Analysis
Peer reviewed Peer reviewed
Direct linkDirect link
Kettler, Ryan J.; Dickenson, Tammiee S.; Bennett, Heather L.; Morgan, Grant B.; Gilmore, Joanna A.; Beddow, Peter A.; Swaffield, Suzanne; Turner, Linda; Herrera, Bill; Turner, Charlene; Palmer, Porter W. – Exceptional Children, 2012
This study was inspired by the final regulations for the No Child Left Behind Act (NCLB) indicating that each state has the option to develop a new assessment for students whose disabilities have kept them from obtaining proficiency. Sets of high school science achievement items were enhanced for the new test. A 3-by-2, within subjects,…
Descriptors: Accessibility (for Disabled), Achievement Tests, Science Achievement, Testing Accommodations
Peer reviewed Peer reviewed
Direct linkDirect link
He, Qingping; Opposs, Dennis – Educational Research and Evaluation, 2012
National tests, public examinations, and vocational qualifications in England are used for a variety of purposes, including the certification of individual learners in different subject areas and the accountability of individual professionals and institutions. However, there has been ongoing debate about the reliability and validity of their…
Descriptors: Qualifications, Evidence, National Competency Tests, Foreign Countries
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Ilyas, Bhutto Muhammad; Rawat, Khalid Jamil; Bhatti, Muhammad Tariq; Malik, Najeeb – International Journal of Instruction, 2013
It is a bitter reality that the curricula and traditional pedagogy prevailing in public schools of Pakistan in general and Sindh in particular do not incorporate the algebraic concepts properly. Both the content and the presentation therein cannot be considered up to the mark, thereby making "Algebra" a tough and dry subject. This…
Descriptors: Algebra, Public Schools, Foreign Countries, Control Groups
Haberman, Shelby J. – Educational Testing Service, 2011
Alternative approaches are discussed for use of e-rater[R] to score the TOEFL iBT[R] Writing test. These approaches involve alternate criteria. In the 1st approach, the predicted variable is the expected rater score of the examinee's 2 essays. In the 2nd approach, the predicted variable is the expected rater score of 2 essay responses by the…
Descriptors: Writing Tests, Scoring, Essays, Language Tests
Deng, Nina – ProQuest LLC, 2011
Three decision consistency and accuracy (DC/DA) methods, the Livingston and Lewis (LL) method, LEE method, and the Hambleton and Han (HH) method, were evaluated. The purposes of the study were: (1) to evaluate the accuracy and robustness of these methods, especially when their assumptions were not well satisfied, (2) to investigate the "true"…
Descriptors: Item Response Theory, Test Theory, Computation, Classification
Peer reviewed Peer reviewed
Direct linkDirect link
Parker, Richard I.; Vannest, Kimberly J.; Davis, John L.; Clemens, Nathan H. – Journal of Special Education, 2012
Within a response to intervention model, educators increasingly use progress monitoring (PM) to support medium- to high-stakes decisions for individual students. For PM to serve these more demanding decisions requires more careful consideration of measurement error. That error should be calculated within a fixed linear regression model rather than…
Descriptors: Measurement, Computation, Response to Intervention, Regression (Statistics)
Peer reviewed Peer reviewed
Direct linkDirect link
Beddow, Peter A. – International Journal of Disability, Development and Education, 2012
In the arena of educational testing, accessibility refers to the degree to which students are given the opportunity to participate in and engage a test. Accessibility theory is a model for examining the interactions between the test-taker and the test itself and defining how they may decrease some students' access to the test event, ultimately…
Descriptors: Test Results, Test Items, Educational Testing, Scores
Peer reviewed Peer reviewed
Direct linkDirect link
Wallace, Colin S.; Bailey, Janelle M. – Astronomy Education Review, 2010
Although concept inventories are among the most frequently used tools in the physics and astronomy education communities, they are rarely evaluated using item response theory (IRT). When IRT models fit the data, they offer sample-independent estimates of item and person parameters. IRT may also provide a way to measure students' learning gains…
Descriptors: Astronomy, Science Tests, Multiple Choice Tests, Item Response Theory
Peer reviewed Peer reviewed
Direct linkDirect link
Andrich, David; Kreiner, Svend – Applied Psychological Measurement, 2010
Models of modern test theory imply statistical independence among responses, generally referred to as "local independence." One violation of local independence occurs when the response to one item governs the response to a subsequent item. Expanding on a formulation of this kind of violation as a process in the dichotomous Rasch model,…
Descriptors: Test Theory, Item Response Theory, Test Items, Correlation
Peer reviewed Peer reviewed
Direct linkDirect link
Kaufman, Alan S. – Journal of Psychoeducational Assessment, 2010
Flynn wrote a book devoted to the Flynn effect, featuring his theoretical explanation of why the intelligence of worldwide populations has apparently increased from generation to generation. The essence of his theorizing is that because of the societal impact of scientific technology, people of today are much more guided by abstract, rather than…
Descriptors: Intelligence Tests, Age Differences, Change, Test Norms
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Guler, Nese; Gelbal, Selahattin – Educational Sciences: Theory and Practice, 2010
In this study, the Classical test theory and generalizability theory were used for determination to reliability of scores obtained from measurement tool of mathematics success. 24 open-ended mathematics question of the TIMSS-1999 was applied to 203 students in 2007-spring semester. Internal consistency of scores was found as 0.92. For…
Descriptors: Generalizability Theory, Test Theory, Test Reliability, Interrater Reliability
Pages: 1  |  ...  |  14  |  15  |  16  |  17  |  18  |  19  |  20  |  21  |  22  |  ...  |  78