Publication Date
| In 2026 | 0 |
| Since 2025 | 220 |
| Since 2022 (last 5 years) | 1089 |
| Since 2017 (last 10 years) | 2599 |
| Since 2007 (last 20 years) | 4960 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 653 |
| Teachers | 563 |
| Researchers | 250 |
| Students | 201 |
| Administrators | 81 |
| Policymakers | 22 |
| Parents | 17 |
| Counselors | 8 |
| Community | 7 |
| Support Staff | 3 |
| Media Staff | 1 |
| More ▼ | |
Location
| Turkey | 226 |
| Canada | 223 |
| Australia | 155 |
| Germany | 116 |
| United States | 99 |
| China | 90 |
| Florida | 86 |
| Indonesia | 82 |
| Taiwan | 78 |
| United Kingdom | 73 |
| California | 66 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 4 |
| Meets WWC Standards with or without Reservations | 4 |
| Does not meet standards | 1 |
Murphy, Daniel L.; Dodd, Barbara G.; Vaughn, Brandon K. – Applied Psychological Measurement, 2010
This study examined the performance of the maximum Fisher's information, the maximum posterior weighted information, and the minimum expected posterior variance methods for selecting items in a computerized adaptive testing system when the items were grouped in testlets. A simulation study compared the efficiency of ability estimation among the…
Descriptors: Simulation, Adaptive Testing, Item Analysis, Item Response Theory
Barrada, Juan Ramon; Olea, Julio; Ponsoda, Vicente; Abad, Francisco Jose – Applied Psychological Measurement, 2010
In a typical study comparing the relative efficiency of two item selection rules in computerized adaptive testing, the common result is that they simultaneously differ in accuracy and security, making it difficult to reach a conclusion on which is the more appropriate rule. This study proposes a strategy to conduct a global comparison of two or…
Descriptors: Test Items, Simulation, Adaptive Testing, Item Analysis
DeMars, Christine E.; Wise, Steven L. – International Journal of Testing, 2010
This investigation examined whether different rates of rapid guessing between groups could lead to detectable levels of differential item functioning (DIF) in situations where the item parameters were the same for both groups. Two simulation studies were designed to explore this possibility. The groups in Study 1 were simulated to reflect…
Descriptors: Guessing (Tests), Test Bias, Motivation, Gender Differences
Michaelides, Michalis P. – Applied Psychological Measurement, 2010
The delta-plot method (Angoff, 1972) is a graphical technique used in the context of test equating for identifying common items with aberrant changes in their item difficulties across administrations or alternate forms. This brief research report explores the effects on equated aggregate scores when delta-plot outliers are either retained in or…
Descriptors: Test Items, Behavior Problems, Measurement, Mathematics Instruction
Canel, Azize Nilgun – Educational Sciences: Theory and Practice, 2013
In this study, the process of developing the Marital Satisfaction Scale (MSS) aiming to support studies in the field of marital satisfaction and to obtain information about couples in a short time through psychological counseling is discussed. The scale including 101 yes-no items aiming to reveal couples' opinions about their marriages was…
Descriptors: Measures (Individuals), Marital Satisfaction, Parents, Child Rearing
Hsu, Ya-Wen; Lu, Frank Jing-Horng – Educational Gerontology, 2013
Physical self-concept plays a central role in older adults' physical health, mental health and psychological well-being; however, little attention has been paid to the underlying dimensions of physical self-concept in the elderly. The purpose of this study was to develop and validate a new measurement for older adults. First, a qualitative study…
Descriptors: Test Construction, Self Concept, Older Adults, Physical Health
Wilhelmsen, Cheryl A. – ProQuest LLC, 2013
The purpose of this study is to identify the important constructs and their key indicators that are to be included on an instrument developed to measure the engineering design process and outcome of students in high schools that use the Project Lead the Way and Engineering by Design curriculums. Several pre-engineering curriculums are used in high…
Descriptors: High School Students, Curriculum, Engineering Education, Outcomes of Education
Ilich, Maria O. – ProQuest LLC, 2013
Psychometricians and test developers evaluate standardized tests for potential bias against groups of test-takers by using differential item functioning (DIF). English language learners (ELLs) are a diverse group of students whose native language is not English. While they are still learning the English language, they must take their standardized…
Descriptors: Spanish Speaking, English Language Learners, Grade 5, Science Tests
Dolan, Robert P.; Burling, Kelly; Harms, Michael; Strain-Seymour, Ellen; Way, Walter; Rose, David H. – Pearson, 2013
The increased capabilities offered by digital technologies offer new opportunities to evaluate students' deeper knowledge and skills and on constructs that are difficult to measure using traditional methods. Such assessments can also incorporate tools and interfaces that improve accessibility for diverse students, as well as inadvertently…
Descriptors: Educational Technology, Technology Uses in Education, Access to Education, Evaluation Methods
Williams-Bonds, Carmen – ProQuest LLC, 2013
The purpose of this study was to compare three groups: JROTC students, student athletes, and other students, to determine if there were differences in academic achievement. Gaining an understanding of the necessary skills required to become academically successful and make healthy life choices, could provide educators working within an urban…
Descriptors: Comparative Analysis, Military Training, High School Students, Urban Schools
Foy, Pierre, Ed.; Drucker, Kathleen T., Ed. – International Association for the Evaluation of Educational Achievement, 2013
This supplement describes national adaptations made to the international version of the PIRLS/prePIRLS 2011 background questionnaires. This information provides users with a guide to evaluate the availability of internationally comparable data for use in secondary analyses involving the PIRLS/prePIRLS 2011 background variables. Background…
Descriptors: Questionnaires, Databases, Media Adaptation, Technology Transfer
Pižorn, Karmen; Moe, Eli – Center for Educational Policy Studies Journal, 2012
This article is a validation study of two national large-scale tests that measure the language proficiency of 11/12 year-old English learners in Norway and Slovenia. Following the example of Alderson and Banerjee (2008), the authors of the article have employed the EALTA guidelines for good practice to validate the tests, and to formulate major…
Descriptors: English (Second Language), Second Language Learning, Guidelines, Test Construction
Anderson, Daniel; Alonzo, Julie; Tindal, Gerald – Behavioral Research and Teaching, 2012
In this technical report, we describe the results of a study of mathematics items written to align with the Common Core State Standards (CCSS) in grades 6-8. In each grade, CCSS items were organized into forms, and the reliability of these forms was evaluated along with an experimental form including items aligned with the National Council of…
Descriptors: Curriculum Based Assessment, Mathematics Tests, Academic Standards, State Standards
Reshetar, Rosemary – College Board, 2012
On 9/13/12, the Workshop on Developing Assessments to Meet the Goals of the 2012 Framework for K-12 Science Education was held at the National Academies of Science. The workshop was organized and led by the NRC Committee on Developing Assessments of Science Proficiency in K-12 (co-chaired by James Pellegrino and Mark Wilson) and targeted to state…
Descriptors: Advanced Placement, Biology, Science Tests, Science Education
Razi, Salim – Online Submission, 2012
This study presents the processes of developing and establishing reliability and validity of a reading test by administering an integrative approach as conventional reliability and validity measures superficially reveals the difficulty of a reading test. In this respect, analysing vocabulary frequency of the test is regarded as a more eligible way…
Descriptors: Foreign Countries, Undergraduate Students, Reading Tests, Test Validity

Peer reviewed
Direct link
