Publication Date
| In 2026 | 0 |
| Since 2025 | 197 |
| Since 2022 (last 5 years) | 1067 |
| Since 2017 (last 10 years) | 2577 |
| Since 2007 (last 20 years) | 4938 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 653 |
| Teachers | 563 |
| Researchers | 250 |
| Students | 201 |
| Administrators | 81 |
| Policymakers | 22 |
| Parents | 17 |
| Counselors | 8 |
| Community | 7 |
| Support Staff | 3 |
| Media Staff | 1 |
| More ▼ | |
Location
| Turkey | 225 |
| Canada | 223 |
| Australia | 155 |
| Germany | 116 |
| United States | 99 |
| China | 90 |
| Florida | 86 |
| Indonesia | 82 |
| Taiwan | 78 |
| United Kingdom | 73 |
| California | 65 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 4 |
| Meets WWC Standards with or without Reservations | 4 |
| Does not meet standards | 1 |
Asim, Alice E.; Ekuri, Emmanuel E.; Eni, Eni I. – Research in Education, 2013
Large class size is an issue in testing at all levels of Education. As a panacea to this, multiple choice test formats has become very popular. This case study was designed to diagnose pre-service teachers' competency in constructing questions (IQT); direct questions (DQT); and best answer (BAT) varieties of multiple choice items. Subjects were 88…
Descriptors: Test Items, Test Construction, Foreign Countries, Class Size
Jiao, Hong; Wang, Shudong; He, Wei – Journal of Educational Measurement, 2013
This study demonstrated the equivalence between the Rasch testlet model and the three-level one-parameter testlet model and explored the Markov Chain Monte Carlo (MCMC) method for model parameter estimation in WINBUGS. The estimation accuracy from the MCMC method was compared with those from the marginalized maximum likelihood estimation (MMLE)…
Descriptors: Computation, Item Response Theory, Models, Monte Carlo Methods
Shea, Christine A. – ProQuest LLC, 2013
The purpose of this study was to determine whether an eighth grade state-level math assessment contained items that function differentially (DIF) for English Learner students (EL) as compared to English Only students (EO) and if so, what factors might have caused DIF. To determine this, Differential Item Functioning (DIF) analysis was employed.…
Descriptors: Item Response Theory, English Language Learners, Grade 8, Mathematics Tests
Porter, Andrew; Polikoff, Morgan S.; Barghaus, Katherine M.; Yang, Rui – Educational Researcher, 2013
We describe an innovative automated test construction algorithm for building aligned achievement tests. By incorporating the algorithm into the test construction process, along with other test construction procedures for building reliable and unbiased assessments, the result is much more valid tests than result from current test construction…
Descriptors: Achievement Tests, Automation, Test Construction, Alignment (Education)
Talento-Miller, Eileen; Guo, Fanmin; Han, Kyung T. – International Journal of Testing, 2013
When power tests include a time limit, it is important to assess the possibility of speededness for examinees. Past research on differential speededness has examined gender and ethnic subgroups in the United States on paper and pencil tests. When considering the needs of a global audience, research regarding different native language speakers is…
Descriptors: Adaptive Testing, Computer Assisted Testing, English, Scores
Çibik, Ayse Sert – International Journal of Environmental and Science Education, 2016
The aim of this study is to compare the change of pre-service science teachers' views about the nature of scientific knowledge through Project-Based History and Nature of Science training and Conventional Method. The sample of the study consists of two groups of 3rd grade undergraduate students attending teacher preparation program of science…
Descriptors: Scientific Principles, Scientific Literacy, Scientific Concepts, Quasiexperimental Design
Billingham, Chase M.; Kimelberg, Shelley McDonough – Journal of Education Policy, 2016
The meaning, measurement, and implications of "public opinion" have long been a source of debate. In this paper, we examine the extent to which the educational priorities of elites in the US reflect the educational priorities of the American public. To do so, we focus on one particular segment of the education policy-making elite --…
Descriptors: Public Opinion, Educational Attitudes, Surveys, National Surveys
Perry, Rebecca R.; Finkelstein, Neal D.; Seago, Nanette; Heredia, Alberto; Sobolew-Shubin, Sandy; Carroll, Cathy – WestEd, 2015
Math in Common® (MiC) is a five-year initiative that supports a formal network of 10 California school districts as they implement the Common Core State Standards in Mathematics (CCSS-M) across grades K-8. In spring 2015, WestEd administered surveys to understand the perspectives on Common Core State Standards-Mathematics (CCSS-M) implementation…
Descriptors: Mathematics Education, Curriculum Implementation, Formative Evaluation, Teacher Evaluation
DeBarger, Angela H.; DiBello, Louis; Minstrell, Jim; Stout, William; Pellegrino, James; Haertel, Geneva; Feng, Mingyu – Society for Research on Educational Effectiveness, 2011
The research design and team constitute a multidisciplinary attack on problems of educational and assessment design in physics instruction. Components of the research include: (a) an Evidence-Centered Design analysis of Diagnoser instructional materials and assessments that provides a view of the evidentiary coherence of the existing system; (b)…
Descriptors: Validity, Formative Evaluation, Physics, Science Instruction
Davey, Tim – Council of Chief State School Officers, 2011
Some brand names are used generically to describe an entire class of products that perform the same function. "Kleenex," "Xerox," "Thermos," and "Band-Aid" are good examples. The term "computerized adaptive testing" (CAT) is similar in that it is often applied uniformly across a diverse family of testing methods. Although the various members of…
Descriptors: Adaptive Testing, Computer Assisted Testing, Delivery Systems, Evaluation Methods
de Oliveira, Luciana C.; Cheng, Dazhi – Reading Matrix: An International Online Journal, 2011
This article explores how language and the multisemiotic nature of mathematics can present potential challenges for English language learners (ELLs). Based on two qualitative studies of the discourse of mathematics, we discuss some of the linguistic challenges of mathematics for ELLs in order to highlight the potential difficulties they may have…
Descriptors: Mathematics, Semiotics, Linguistics, English Language Learners
Pibal, Florian; Cesnik, Hermann S. – Practical Assessment, Research & Evaluation, 2011
When administering tests across grades, vertical scaling is often employed to place scores from different tests on a common overall scale so that test-takers' progress can be tracked. In order to be able to link the results across grades, however, common items are needed that are included in both test forms. In the literature there seems to be no…
Descriptors: Scaling, Test Items, Equated Scores, Reading Tests
Lee, Jihyun; Corter, James E. – Applied Psychological Measurement, 2011
Diagnosis of misconceptions or "bugs" in procedural skills is difficult because of their unstable nature. This study addresses this problem by proposing and evaluating a probability-based approach to the diagnosis of bugs in children's multicolumn subtraction performance using Bayesian networks. This approach assumes a causal network relating…
Descriptors: Misconceptions, Probability, Children, Subtraction
Butters, Roger B.; Walstad, William B. – Journal of Economic Education, 2011
Interest is growing at the precollege level in computer testing (CT) instead of paper-and-pencil testing (PT) for subjects in the school curriculum, including economics. Before economic educators adopt CT, a better understanding of its likely effects on test-taking behavior and performance compared with PT is needed. Using two volunteer student…
Descriptors: Computer Assisted Testing, Economics Education, Grade 8, Grade 9
Schuster, Christof; Yuan, Ke-Hai – Journal of Educational and Behavioral Statistics, 2011
Because of response disturbances such as guessing, cheating, or carelessness, item response models often can only approximate the "true" individual response probabilities. As a consequence, maximum-likelihood estimates of ability will be biased. Typically, the nature and extent to which response disturbances are present is unknown, and, therefore,…
Descriptors: Computation, Item Response Theory, Probability, Maximum Likelihood Statistics

Peer reviewed
Direct link
