Publication Date
| In 2026 | 0 |
| Since 2025 | 220 |
| Since 2022 (last 5 years) | 1089 |
| Since 2017 (last 10 years) | 2599 |
| Since 2007 (last 20 years) | 4960 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 653 |
| Teachers | 563 |
| Researchers | 250 |
| Students | 201 |
| Administrators | 81 |
| Policymakers | 22 |
| Parents | 17 |
| Counselors | 8 |
| Community | 7 |
| Support Staff | 3 |
| Media Staff | 1 |
| More ▼ | |
Location
| Turkey | 226 |
| Canada | 223 |
| Australia | 155 |
| Germany | 116 |
| United States | 99 |
| China | 90 |
| Florida | 86 |
| Indonesia | 82 |
| Taiwan | 78 |
| United Kingdom | 73 |
| California | 66 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 4 |
| Meets WWC Standards with or without Reservations | 4 |
| Does not meet standards | 1 |
Calik, Muammer; Ayas, Alipasa; Coll, Richard K. – International Journal of Science and Mathematics Education, 2009
This paper reports on an investigation on the use of an analogy activity and seeks to provide evidence of whether the activity enables students to change alternative conceptions towards views more in accord with scientific views for aspects of solution chemistry. We were also interested in how robust any change was and whether these changes in…
Descriptors: Test Items, Chemistry, Long Term Memory, Foreign Countries
Jordan, Sally; Mitchell, Tom – British Journal of Educational Technology, 2009
A natural language based system has been used to author and mark short-answer free-text assessment tasks. Students attempt the questions online and are given tailored and relatively detailed feedback on incorrect and incomplete responses, and have the opportunity to repeat the task immediately so as to learn from the feedback provided. The answer…
Descriptors: Feedback (Response), Test Items, Natural Language Processing, Teaching Methods
Siddiek, Ahmed Gumaa – English Language Teaching, 2010
Examinations--among other things--are tools of quality control by which we can measure the attainment of the national educational goals. High-quality examinations are means of evaluation that can help teachers modify their teaching techniques, as well as helping learners adjust their learning strategies. Examinations are also benchmarks that can…
Descriptors: Foreign Countries, Student Certification, Questionnaires, Test Validity
Rakes, Christopher R. – ProQuest LLC, 2010
In this study, the author examined the relationship of probability misconceptions to algebra, geometry, and rational number misconceptions and investigated the potential of probability instruction as an intervention to address misconceptions in all 4 content areas. Through a review of literature, 5 fundamental concepts were identified that, if…
Descriptors: Control Groups, Fundamental Concepts, Intervention, Structural Equation Models
Sawchuk, Stephen – Education Digest: Essential Readings Condensed for Quick Review, 2010
Most experts in the testing community have presumed that the $350 million promised by the U.S. Department of Education to support common assessments would promote those that made greater use of open-ended items capable of measuring higher-order critical-thinking skills. But as measurement experts consider the multitude of possibilities for an…
Descriptors: Educational Quality, Test Items, Comparative Analysis, Multiple Choice Tests
Walsh, Kerryann; Rassafiani, Mehdi; Mathews, Ben; Farrell, Ann; Butler, Des – Journal of Child Sexual Abuse, 2010
This paper details a systematic literature review identifying problems in extant research relating to teachers' attitudes toward reporting child sexual abuse and offers a model for new attitude scale development and testing. Scale development comprised a five-phase process grounded in contemporary attitude theories, including (a) developing the…
Descriptors: Sexual Abuse, Child Abuse, Focus Groups, Content Validity
Bernardo, Alejandro S. – Journal on English Language Teaching, 2011
This study examined the "communicativeness" of 22 English language tests designed and administered by 22 English instructors from 22 different colleges and universities in the Philippines. Its key objective was to answer the question "How communicative are the language tests used in assessing students' competence (knowledge of the…
Descriptors: Foreign Countries, Communicative Competence (Languages), Case Studies, English
Circelli, Michelle; Curtis, David; Perkins, Kate – National Centre for Vocational Education Research (NCVER), 2011
Language, literacy and numeracy are necessary for greater workforce participation, productivity and social inclusion. Being able to measure the level of proficiency in these skills, and any changes in the level of skills, is important for getting a sense of how well language, literacy and numeracy programs are working. Two measurement tools used…
Descriptors: Foreign Countries, Adult Literacy, Surveys, Educational Assessment
Kaliski, Pamela; Huff, Kristen; Barry, Carol – College Board, 2011
For educational achievement tests that employ multiple-choice (MC) items and aim to reliably classify students into performance categories, it is critical to design MC items that are capable of discriminating student performance according to the stated achievement levels. This is accomplished, in part, by clearly understanding how item design…
Descriptors: Alignment (Education), Academic Achievement, Expertise, Evaluative Thinking
Taylor, Catherine S.; Lee, Yoonsun – Educational Assessment, 2011
This article presents a study of ethnic Differential Item Functioning (DIF) for 4th-, 7th-, and 10th-grade reading items on a state criterion-referenced achievement test. The tests, administered 1997 to 2001, were composed of multiple-choice and constructed-response items. Item performance by focal groups (i.e., students from Asian/Pacific Island,…
Descriptors: Test Bias, Test Items, Pacific Islanders, American Indians
Puhan, Gautam; vonDavier, Alina; Gupta, Shaloo – ETS Research Report Series, 2008
Equating under the external anchor design is frequently conducted using scaled scores on the anchor test. However, scaled scores often lead to the unique problem of creating zero frequencies in the score distribution because there may not always be a one-to-one correspondence between raw and scaled scores. For example, raw scores of 17 and 18 may…
Descriptors: Equated Scores, Test Items, Raw Scores, Statistical Analysis
Tristan-Lopez, Agustin; Mendoza-Gonzalez, Liliana; Diaz-Gutierrez, Maria Antonieta; Flores-Vazquez, Gustavo; Solis-Gonzalez, Roberto; Canales-Sanchez, Damian; Morelos-Mora, Placido; de la C. Hernandez, Yesenia – Online Submission, 2008
The international OECD PISA [Programme for International Assessment] 2006 test focused on the performance of Sciences of 15 years old students. The unsatisfactory results from Mexico were submitted to analysis, including multilevel models, to explain the origin of their deficiencies. It was clear that a differential functioning behavior or a…
Descriptors: Science Achievement, Science Tests, Test Validity, Test Construction
Walker, Cindy M.; Zhang, Bo; Surber, John – Applied Measurement in Education, 2008
Many teachers and curriculum specialists claim that the reading demand of many mathematics items is so great that students do not perform well on mathematics tests, even though they have a good understanding of mathematics. The purpose of this research was to test this claim empirically. This analysis was accomplished by considering examinees that…
Descriptors: Test Items, Construct Validity, Test Validity, Mathematics Tests
Lin, Chuan-Ju – Journal of Technology, Learning, and Assessment, 2008
The automated assembly of alternate test forms for online delivery provides an alternative to computer-administered, fixed test forms, or computerized-adaptive tests when a testing program migrates from paper/pencil testing to computer-based testing. The weighted deviations model (WDM) heuristic particularly promising for automated test assembly…
Descriptors: Item Response Theory, Test Theory, Comparative Analysis, Computer Assisted Testing
Montgomery, Janine Marie; Newton, Brendan; Smith, Christiane – Journal of Psychoeducational Assessment, 2008
The Gilliam Autism Rating Scale-Second Edition (GARS-2) is a screening tool for autism spectrum disorders for individuals between the ages of 3 and 22. It was designed to help differentiate those with autism from those with severe behavioral disorders as well as from those who are typically developing. It is a norm-referenced instrument that…
Descriptors: Autism, Rating Scales, Test Reviews, Norm Referenced Tests

Peer reviewed
Direct link
