Publication Date
| In 2026 | 0 |
| Since 2025 | 220 |
| Since 2022 (last 5 years) | 1089 |
| Since 2017 (last 10 years) | 2599 |
| Since 2007 (last 20 years) | 4960 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 653 |
| Teachers | 563 |
| Researchers | 250 |
| Students | 201 |
| Administrators | 81 |
| Policymakers | 22 |
| Parents | 17 |
| Counselors | 8 |
| Community | 7 |
| Support Staff | 3 |
| Media Staff | 1 |
| More ▼ | |
Location
| Turkey | 226 |
| Canada | 223 |
| Australia | 155 |
| Germany | 116 |
| United States | 99 |
| China | 90 |
| Florida | 86 |
| Indonesia | 82 |
| Taiwan | 78 |
| United Kingdom | 73 |
| California | 66 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 4 |
| Meets WWC Standards with or without Reservations | 4 |
| Does not meet standards | 1 |
DeBarger, Angela H.; DiBello, Louis; Minstrell, Jim; Stout, William; Pellegrino, James; Haertel, Geneva; Feng, Mingyu – Society for Research on Educational Effectiveness, 2011
The research design and team constitute a multidisciplinary attack on problems of educational and assessment design in physics instruction. Components of the research include: (a) an Evidence-Centered Design analysis of Diagnoser instructional materials and assessments that provides a view of the evidentiary coherence of the existing system; (b)…
Descriptors: Validity, Formative Evaluation, Physics, Science Instruction
Davey, Tim – Council of Chief State School Officers, 2011
Some brand names are used generically to describe an entire class of products that perform the same function. "Kleenex," "Xerox," "Thermos," and "Band-Aid" are good examples. The term "computerized adaptive testing" (CAT) is similar in that it is often applied uniformly across a diverse family of testing methods. Although the various members of…
Descriptors: Adaptive Testing, Computer Assisted Testing, Delivery Systems, Evaluation Methods
de Oliveira, Luciana C.; Cheng, Dazhi – Reading Matrix: An International Online Journal, 2011
This article explores how language and the multisemiotic nature of mathematics can present potential challenges for English language learners (ELLs). Based on two qualitative studies of the discourse of mathematics, we discuss some of the linguistic challenges of mathematics for ELLs in order to highlight the potential difficulties they may have…
Descriptors: Mathematics, Semiotics, Linguistics, English Language Learners
Pibal, Florian; Cesnik, Hermann S. – Practical Assessment, Research & Evaluation, 2011
When administering tests across grades, vertical scaling is often employed to place scores from different tests on a common overall scale so that test-takers' progress can be tracked. In order to be able to link the results across grades, however, common items are needed that are included in both test forms. In the literature there seems to be no…
Descriptors: Scaling, Test Items, Equated Scores, Reading Tests
Lee, Jihyun; Corter, James E. – Applied Psychological Measurement, 2011
Diagnosis of misconceptions or "bugs" in procedural skills is difficult because of their unstable nature. This study addresses this problem by proposing and evaluating a probability-based approach to the diagnosis of bugs in children's multicolumn subtraction performance using Bayesian networks. This approach assumes a causal network relating…
Descriptors: Misconceptions, Probability, Children, Subtraction
Butters, Roger B.; Walstad, William B. – Journal of Economic Education, 2011
Interest is growing at the precollege level in computer testing (CT) instead of paper-and-pencil testing (PT) for subjects in the school curriculum, including economics. Before economic educators adopt CT, a better understanding of its likely effects on test-taking behavior and performance compared with PT is needed. Using two volunteer student…
Descriptors: Computer Assisted Testing, Economics Education, Grade 8, Grade 9
Schuster, Christof; Yuan, Ke-Hai – Journal of Educational and Behavioral Statistics, 2011
Because of response disturbances such as guessing, cheating, or carelessness, item response models often can only approximate the "true" individual response probabilities. As a consequence, maximum-likelihood estimates of ability will be biased. Typically, the nature and extent to which response disturbances are present is unknown, and, therefore,…
Descriptors: Computation, Item Response Theory, Probability, Maximum Likelihood Statistics
Fisher, Anna V. – Cognition, 2011
Is processing of conceptual information as robust as processing of perceptual information early in development? Existing empirical evidence is insufficient to answer this question. To examine this issue, 3- to 5-year-old children were presented with a flexible categorization task, in which target items (e.g., an open red umbrella) shared category…
Descriptors: Test Items, Classification, Preschool Children, Cognitive Processes
Dorans, Neil J. – Harvard Educational Review, 2010
In his 2003 article in the "Harvard Educational Review" (HER), Freedle claimed that the SAT was both culturally and statistically biased and proposed a solution to ameliorate this bias. The author argued (Dorans, 2004a) that these claims were based on serious computational errors. In particular, he focused on how Freedle's table 2 was…
Descriptors: College Entrance Examinations, Test Bias, Test Items, Difficulty Level
Santelices, Maria Veronica; Wilson, Mark – Harvard Educational Review, 2010
In their paper "Unfair Treatment? The Case of Freedle, the SAT, and the Standardization Approach to Differential Item Functioning" (Santelices & Wilson, 2010), the authors studied claims of differential effects of the SAT on Latinos and African Americans through the methodology of differential item functioning (DIF). Previous…
Descriptors: College Entrance Examinations, Test Bias, Test Items, Difficulty Level
Voyer, Daniel; Doyle, Randi A. – Learning and Individual Differences, 2010
This study investigated gender differences on the Mental Rotations Test (MRT) as a function of item and response types. Accordingly, 86 male and 109 female undergraduate students completed the MRT without time limits. Responses were coded as reflecting two correct (CC), one correct and one wrong (CW), two wrong (WW), one correct and one blank…
Descriptors: Test Items, Gender Differences, Undergraduate Students, Spatial Ability
Suh, Yonghee; Grant, Leslie W. – History Teacher, 2014
Assessing students' historical understanding has been a long-standing challenge in history education. One of the widely used tools for accomplishing this task is the large-scale standardized test, the results of which are used as an indicator of student knowledge and skills in the social sciences/history. At the national level, the National…
Descriptors: National Competency Tests, History Instruction, Teaching Methods, Knowledge Level
Zumrawi, Abdel Azim; Bates, Simon P.; Schroeder, Marianne – Educational Research and Evaluation, 2014
This paper addresses the determination of statistically desirable response rates in students' surveys, with emphasis on assessing the effect of underlying variability in the student evaluation of teaching (SET). We discuss factors affecting the determination of adequate response rates and highlight challenges caused by non-response and lack of…
Descriptors: Inferences, Test Reliability, Response Rates (Questionnaires), Student Evaluation of Teacher Performance
Chiu, Chung-Yi; Jochman, Joseph; Fujikawa, Mayu; Strand, David; Cheing, Gladys; Lee, Gloria; Chan, Fong – Rehabilitation Research, Policy, and Education, 2014
Purpose: To examine the factorial structure of the "Coping Strategy Questionnaire"-24 (CSQ-24) in a sample of Canadians with chronic musculoskeletal pain. Method: The sample included 171 workers' compensation clients (50.9% men) recruited from outpatient rehabilitation facilities in Canada. Mean age of participants was 42.45 years (SD =…
Descriptors: Factor Analysis, Questionnaires, Coping, Measurement Techniques
Çil, Emine; Çepni, Salih – International Education Studies, 2014
International examination results have already influenced many countries to make radical reforms in education system. According to these results countries have been categorized as high, middle and low achievement in education. Turkey has also taken these results into consideration quite seriously and started to investigate to what extent there are…
Descriptors: Science Achievement, Foreign Countries, Science Curriculum, Test Items

Peer reviewed
Direct link
