NotesFAQContact Us
Collection
Advanced
Search Tips
Showing 4,531 to 4,545 of 9,533 results Save | Export
Sawchuk, Stephen – Education Digest: Essential Readings Condensed for Quick Review, 2010
Most experts in the testing community have presumed that the $350 million promised by the U.S. Department of Education to support common assessments would promote those that made greater use of open-ended items capable of measuring higher-order critical-thinking skills. But as measurement experts consider the multitude of possibilities for an…
Descriptors: Educational Quality, Test Items, Comparative Analysis, Multiple Choice Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Walsh, Kerryann; Rassafiani, Mehdi; Mathews, Ben; Farrell, Ann; Butler, Des – Journal of Child Sexual Abuse, 2010
This paper details a systematic literature review identifying problems in extant research relating to teachers' attitudes toward reporting child sexual abuse and offers a model for new attitude scale development and testing. Scale development comprised a five-phase process grounded in contemporary attitude theories, including (a) developing the…
Descriptors: Sexual Abuse, Child Abuse, Focus Groups, Content Validity
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Puhan, Gautam; vonDavier, Alina; Gupta, Shaloo – ETS Research Report Series, 2008
Equating under the external anchor design is frequently conducted using scaled scores on the anchor test. However, scaled scores often lead to the unique problem of creating zero frequencies in the score distribution because there may not always be a one-to-one correspondence between raw and scaled scores. For example, raw scores of 17 and 18 may…
Descriptors: Equated Scores, Test Items, Raw Scores, Statistical Analysis
Tristan-Lopez, Agustin; Mendoza-Gonzalez, Liliana; Diaz-Gutierrez, Maria Antonieta; Flores-Vazquez, Gustavo; Solis-Gonzalez, Roberto; Canales-Sanchez, Damian; Morelos-Mora, Placido; de la C. Hernandez, Yesenia – Online Submission, 2008
The international OECD PISA [Programme for International Assessment] 2006 test focused on the performance of Sciences of 15 years old students. The unsatisfactory results from Mexico were submitted to analysis, including multilevel models, to explain the origin of their deficiencies. It was clear that a differential functioning behavior or a…
Descriptors: Science Achievement, Science Tests, Test Validity, Test Construction
Peer reviewed Peer reviewed
Direct linkDirect link
Walker, Cindy M.; Zhang, Bo; Surber, John – Applied Measurement in Education, 2008
Many teachers and curriculum specialists claim that the reading demand of many mathematics items is so great that students do not perform well on mathematics tests, even though they have a good understanding of mathematics. The purpose of this research was to test this claim empirically. This analysis was accomplished by considering examinees that…
Descriptors: Test Items, Construct Validity, Test Validity, Mathematics Tests
Lin, Chuan-Ju – Journal of Technology, Learning, and Assessment, 2008
The automated assembly of alternate test forms for online delivery provides an alternative to computer-administered, fixed test forms, or computerized-adaptive tests when a testing program migrates from paper/pencil testing to computer-based testing. The weighted deviations model (WDM) heuristic particularly promising for automated test assembly…
Descriptors: Item Response Theory, Test Theory, Comparative Analysis, Computer Assisted Testing
Peer reviewed Peer reviewed
Direct linkDirect link
Montgomery, Janine Marie; Newton, Brendan; Smith, Christiane – Journal of Psychoeducational Assessment, 2008
The Gilliam Autism Rating Scale-Second Edition (GARS-2) is a screening tool for autism spectrum disorders for individuals between the ages of 3 and 22. It was designed to help differentiate those with autism from those with severe behavioral disorders as well as from those who are typically developing. It is a norm-referenced instrument that…
Descriptors: Autism, Rating Scales, Test Reviews, Norm Referenced Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Roberts, James S. – Applied Psychological Measurement, 2008
Orlando and Thissen (2000) developed an item fit statistic for binary item response theory (IRT) models known as S-X[superscript 2]. This article generalizes their statistic to polytomous unfolding models. Four alternative formulations of S-X[superscript 2] are developed for the generalized graded unfolding model (GGUM). The GGUM is a…
Descriptors: Item Response Theory, Goodness of Fit, Test Items, Models
Peer reviewed Peer reviewed
Direct linkDirect link
Wasylkiw, Louise; Tomes, Jennifer L.; Smith, Francine – Journal of Experimental Education, 2008
In 3 studies, the authors examined the prevalence and effects of a testing strategy whereby they gave a set of items to participants in advance and subsequently tested them on a portion of those items (i.e., subset testing). In a survey of university instructors, Study 1 showed that subset testing is a commonly used testing strategy. In this…
Descriptors: Undergraduate Students, Incidence, Definitions, Testing
Peer reviewed Peer reviewed
Direct linkDirect link
Cui, Zhongmin; Kolen, Michael J. – Applied Psychological Measurement, 2008
This article considers two methods of estimating standard errors of equipercentile equating: the parametric bootstrap method and the nonparametric bootstrap method. Using a simulation study, these two methods are compared under three sample sizes (300, 1,000, and 3,000), for two test content areas (the Iowa Tests of Basic Skills Maps and Diagrams…
Descriptors: Test Length, Test Content, Simulation, Computation
Peer reviewed Peer reviewed
Direct linkDirect link
Tal, Ilanit R.; Akers, Katherine G.; Hodge, Gordon K. – Teaching of Psychology, 2008
To deter cheating, teachers commonly use exams printed on differently colored paper or with varied question orders. Previous studies, however, reported that paper color and question order affect exam performance and suggested that teachers should adjust students' scores accordingly and discontinue the use of alternate exam forms. We conducted 2…
Descriptors: Evaluation Methods, Student Evaluation, Color, Visual Environment
Bernardo, Alejandro S. – Journal on English Language Teaching, 2011
This study examined the "communicativeness" of 22 English language tests designed and administered by 22 English instructors from 22 different colleges and universities in the Philippines. Its key objective was to answer the question "How communicative are the language tests used in assessing students' competence (knowledge of the…
Descriptors: Foreign Countries, Communicative Competence (Languages), Case Studies, English
Circelli, Michelle; Curtis, David; Perkins, Kate – National Centre for Vocational Education Research (NCVER), 2011
Language, literacy and numeracy are necessary for greater workforce participation, productivity and social inclusion. Being able to measure the level of proficiency in these skills, and any changes in the level of skills, is important for getting a sense of how well language, literacy and numeracy programs are working. Two measurement tools used…
Descriptors: Foreign Countries, Adult Literacy, Surveys, Educational Assessment
Kaliski, Pamela; Huff, Kristen; Barry, Carol – College Board, 2011
For educational achievement tests that employ multiple-choice (MC) items and aim to reliably classify students into performance categories, it is critical to design MC items that are capable of discriminating student performance according to the stated achievement levels. This is accomplished, in part, by clearly understanding how item design…
Descriptors: Alignment (Education), Academic Achievement, Expertise, Evaluative Thinking
Peer reviewed Peer reviewed
Direct linkDirect link
Taylor, Catherine S.; Lee, Yoonsun – Educational Assessment, 2011
This article presents a study of ethnic Differential Item Functioning (DIF) for 4th-, 7th-, and 10th-grade reading items on a state criterion-referenced achievement test. The tests, administered 1997 to 2001, were composed of multiple-choice and constructed-response items. Item performance by focal groups (i.e., students from Asian/Pacific Island,…
Descriptors: Test Bias, Test Items, Pacific Islanders, American Indians
Pages: 1  |  ...  |  299  |  300  |  301  |  302  |  303  |  304  |  305  |  306  |  307  |  ...  |  636