Publication Date
| In 2026 | 0 |
| Since 2025 | 222 |
| Since 2022 (last 5 years) | 1091 |
| Since 2017 (last 10 years) | 2601 |
| Since 2007 (last 20 years) | 4962 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 653 |
| Teachers | 563 |
| Researchers | 250 |
| Students | 201 |
| Administrators | 81 |
| Policymakers | 22 |
| Parents | 17 |
| Counselors | 8 |
| Community | 7 |
| Support Staff | 3 |
| Media Staff | 1 |
| More ▼ | |
Location
| Turkey | 227 |
| Canada | 223 |
| Australia | 155 |
| Germany | 116 |
| United States | 99 |
| China | 90 |
| Florida | 86 |
| Indonesia | 82 |
| Taiwan | 78 |
| United Kingdom | 73 |
| California | 66 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 4 |
| Meets WWC Standards with or without Reservations | 4 |
| Does not meet standards | 1 |
van der Linden, Wim J. – Journal of Educational and Behavioral Statistics, 2008
Response times on items can be used to improve item selection in adaptive testing provided that a probabilistic model for their distribution is available. In this research, the author used a hierarchical modeling framework with separate first-level models for the responses and response times and a second-level model for the distribution of the…
Descriptors: Reaction Time, Law Schools, Adaptive Testing, Item Analysis
Liu, Xiufeng; Fulmer, Gavin – Journal of Science Education and Technology, 2008
This article reports on an analysis of alignment between NY state core curricula and NY Regents tests in physics and chemistry. Both the curriculum and test were represented by a two dimensional table consisting of topics and cognitive demands. The cell values of the table were numbers of major understandings in the curriculum and points of test…
Descriptors: Core Curriculum, Test Items, Science Curriculum, Curriculum Development
Nardi, Emma – Assessment in Education: Principles, Policy & Practice, 2008
In response to Barry McGaw's article "The role of the OECD in international comparative studies of achievement", this contribution explores two issues which the author considers closely interrelated. The first issue concerns the quality of the items produced to assess reading skills in the OECD-PISA and OECD-SIALS studies and the…
Descriptors: Cultural Awareness, Foreign Countries, Comparative Analysis, Reading Skills
Wang, Tzu-Hua; Wang, Kuo-Hua; Huang, Shih-Chieh – Computers & Education, 2008
Teacher assessment literacy is a key factor in the success of teaching, but some studies concluded that teachers lack it. The aim of this research is to propose the ''Practicing, Reflecting and Revising with WATA system (P2R-WATA) Assessment Literacy Development Model'' for improving pre-service teacher assessment literacy. WATA system offers…
Descriptors: Preservice Teacher Education, Test Items, Item Analysis, Literacy
Hu, Huiqin; Rogers, W. Todd; Vukmirovic, Zarko – Applied Psychological Measurement, 2008
Common items with inconsistent b-parameter estimates may have a serious impact on item response theory (IRT)--based equating results. To find a better way to deal with the outlier common items with inconsistent b-parameters, the current study investigated the comparability of 10 variations of four IRT-based equating methods (i.e., concurrent…
Descriptors: Item Response Theory, Item Analysis, Computer Simulation, Equated Scores
Tristan, Agustin; Vidal, Rafael – Online Submission, 2007
Wright and Stone had proposed three features to assess the quality of the distribution of the items difficulties in a test, on the so called "most probable response map": line, stack and gap. Once a line is accepted as a design model for a test, gaps and stacks are practically eliminated, producing an evidence of the "scale…
Descriptors: Test Validity, Models, Difficulty Level, Test Items
Webb, Noreen M.; Herman, Joan L.; Webb, Norman L. – Educational Measurement: Issues and Practice, 2007
This article examines the role of reviewer agreement in judgments about alignment between tests and standards. We used case data from three state alignment studies to explore how different approaches to incorporating reviewer agreement changes alignment conclusions. The three case studies showed varying degrees of reviewer agreement about…
Descriptors: Test Items, Case Studies, Mathematics, Interrater Reliability
Butler, Andrew C.; Karpicke, Jeffrey D.; Roediger, Henry L., III – Journal of Experimental Psychology: Applied, 2007
Two experiments investigated how the type and timing of feedback influence learning from a multiple-choice test. First, participants read 12 prose passages, which covered various general knowledge topics (e.g., The Sun) and ranged between 280 and 300 words in length. Next, they took an initial six-alternative, multiple-choice test on information…
Descriptors: Test Items, Multiple Choice Tests, Prose, Test Results
Newman, Daniel A.; Hanges, Paul J.; Outtz, James L. – American Psychologist, 2007
According to Helms, "test fairness" is defined as "removal from test scores of systematic variance attributable to experiences of racial or cultural socialization." Some of Helms's reasoning is based on earlier work, which recommended that racial group or category variables be replaced entirely with individual-level constructs, to reflect racial…
Descriptors: Race, Socialization, Test Items, Construct Validity
Lu, Ying; Sireci, Stephen G. – Educational Measurement: Issues and Practice, 2007
Speededness refers to the situation where the time limits on a standardized test do not allow substantial numbers of examinees to fully consider all test items. When tests are not intended to measure speed of responding, speededness introduces a severe threat to the validity of interpretations based on test scores. In this article, we describe…
Descriptors: Test Items, Timed Tests, Standardized Tests, Test Validity
Ratliff, Bobby Kevin – ProQuest LLC, 2009
The purpose of this study was to determine (1) strategies students use when solving composition problems and the difficulties they encounter; (2) conceptions and/or misconceptions students have with respect to composition of functions; and (3) the effect of using dynamic visualization during instruction on students' understanding of composition of…
Descriptors: Test Items, Visualization, Concept Formation, Calculus
West, Emily Lincoln Ashbaugh – ProQuest LLC, 2009
Prior research across hundreds for introductory physics courses has demonstrated that traditional physics instruction does not generally lead to students learning physics concepts in a meaningful way, but that interactive-engagement physics courses do sometimes promote a great deal more student learning. In this work I analyze a reform effort in a…
Descriptors: Curriculum Design, Test Items, Mechanics (Physics), Peer Groups
Frey, Andreas; Seitz, Nicki-Nils – Studies in Educational Evaluation, 2009
The paper gives an overview of multidimensional adaptive testing (MAT) and evaluates its applicability in educational and psychological testing. The approach of Segall (1996) is described as a general framework for MAT. The main advantage of MAT is its capability to increase measurement efficiency. In simulation studies conceptualizing situations…
Descriptors: Psychological Testing, Adaptive Testing, Simulation, Evaluation Methods
Culpepper, Steven Andrew – Multivariate Behavioral Research, 2009
This study linked nonlinear profile analysis (NPA) of dichotomous responses with an existing family of item response theory models and generalized latent variable models (GLVM). The NPA method offers several benefits over previous internal profile analysis methods: (a) NPA is estimated with maximum likelihood in a GLVM framework rather than…
Descriptors: Profiles, Item Response Theory, Models, Maximum Likelihood Statistics
Peer reviewedSalehi, Mohammad; Rezaee, Abbas Ali – Indian Journal of Applied Linguistics, 2009
The study was conducted with 3,385 participants who took an English language proficiency test as a partial requirement for entering a PhD program in different fields of education. This test has three sections which are grammar, vocabulary and reading comprehension. To determine the construct validity of the test, a series of analyses were done.…
Descriptors: Reading Comprehension, Test Items, Construct Validity, Foreign Countries

Direct link
