Publication Date
| In 2026 | 0 |
| Since 2025 | 200 |
| Since 2022 (last 5 years) | 1070 |
| Since 2017 (last 10 years) | 2580 |
| Since 2007 (last 20 years) | 4941 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 653 |
| Teachers | 563 |
| Researchers | 250 |
| Students | 201 |
| Administrators | 81 |
| Policymakers | 22 |
| Parents | 17 |
| Counselors | 8 |
| Community | 7 |
| Support Staff | 3 |
| Media Staff | 1 |
| More ▼ | |
Location
| Turkey | 225 |
| Canada | 223 |
| Australia | 155 |
| Germany | 116 |
| United States | 99 |
| China | 90 |
| Florida | 86 |
| Indonesia | 82 |
| Taiwan | 78 |
| United Kingdom | 73 |
| California | 65 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 4 |
| Meets WWC Standards with or without Reservations | 4 |
| Does not meet standards | 1 |
Socha, Alan; DeMars, Christine E. – Educational and Psychological Measurement, 2013
Modeling multidimensional test data with a unidimensional model can result in serious statistical errors, such as bias in item parameter estimates. Many methods exist for assessing the dimensionality of a test. The current study focused on DIMTEST. Using simulated data, the effects of sample size splitting for use with the ATFIND procedure for…
Descriptors: Sample Size, Test Length, Correlation, Test Format
Wang, Chun; Fan, Zhewen; Chang, Hua-Hua; Douglas, Jeffrey A. – Journal of Educational and Behavioral Statistics, 2013
The item response times (RTs) collected from computerized testing represent an underutilized type of information about items and examinees. In addition to knowing the examinees' responses to each item, we can investigate the amount of time examinees spend on each item. Current models for RTs mainly focus on parametric models, which have the…
Descriptors: Reaction Time, Computer Assisted Testing, Test Items, Accuracy
Valdez, Alfred – International Journal of Higher Education, 2013
Metacognitive monitoring processes have been shown to be critical determinants of human learning. Metacognitive monitoring consist of various knowledge estimates that enable learners to engage in self-regulatory processes important for both the acquisition of knowledge and the monitoring of one's knowledge when engaged in assessment. This study…
Descriptors: Metacognition, Accuracy, Correlation, Validity
Hsieh, Feng-Jui – International Journal of Science and Mathematics Education, 2013
This paper discusses different conceptual frameworks for measuring mathematics pedagogical content knowledge (MPCK) in international comparison studies. Two large-scale international comparative studies, "Mathematics Teaching in the Twenty-First Century" (MT21; Schmidt et al., 2011) and the "Teacher Education and Development Study…
Descriptors: Pedagogical Content Knowledge, Mathematics Teachers, Mathematics Instruction, Foreign Countries
Hess, Brian J.; Johnston, Mary M.; Lipner, Rebecca S. – International Journal of Testing, 2013
Current research on examination response time has focused on tests comprised of traditional multiple-choice items. Consequently, the impact of other innovative or complex item formats on examinee response time is not understood. The present study used multilevel growth modeling to investigate examinee characteristics associated with response time…
Descriptors: Test Items, Test Format, Reaction Time, Individual Characteristics
Senocak, Erdal; Samarapungavan, Ala; Aksoy, Pinar; Tosun, Cemal – Educational Sciences: Theory and Practice, 2013
The aim of this study was to develop a valid and reliable instrument to measure Turkish kindergarten students' understandings of some science concepts and scientific inquiry processes which are grounded in the Turkish Preschool Curriculum. The sample of the study was 371 kindergarten students, 12 Subject Area Experts (SAE), and 7 Turkish Language…
Descriptors: Foreign Countries, Kindergarten, Scientific Concepts, Inquiry
Miguel, Jose P.; Silva, Jose T.; Prieto, Gerardo – Journal of Vocational Behavior, 2013
The present study analyzes the psychometric properties of the Career Decision Self-Efficacy Scale-Short Form (CDSE-SF) in a sample of Portuguese secondary education students using the Rasch model. The results indicate that the 25 items of the CDSE-SF are well fitted to a latent unidimensional structure, as required by Rasch modeling. The response…
Descriptors: Career Choice, Self Efficacy, Measures (Individuals), Psychometrics
Joseph, Corina; Nichol, Esmie Obrin; Janggu, Tamoi; Madi, Nero – International Journal of Sustainability in Higher Education, 2013
Purpose: The purpose of this paper is to examine the level of environmental literacy among business lecturers in Malaysia. Design/methodology/approach: A survey, which involved a combination of newly developed items and items adopted from past studies, was used to collect data from 35 respondents (out of 70). Findings: The overall mean score for…
Descriptors: Environmental Education, Foreign Countries, Business Education Teachers, Social Responsibility
Jin, Ying; Myers, Nicholas D.; Ahn, Soyeon; Penfield, Randall D. – Educational and Psychological Measurement, 2013
The Rasch model, a member of a larger group of models within item response theory, is widely used in empirical studies. Detection of uniform differential item functioning (DIF) within the Rasch model typically employs null hypothesis testing with a concomitant consideration of effect size (e.g., signed area [SA]). Parametric equivalence between…
Descriptors: Test Bias, Effect Size, Item Response Theory, Comparative Analysis
Bowden, Stephen C.; Petrauskas, Vilija M.; Bardenhagen, Fiona J.; Meade, Catherine E.; Simpson, Leonie C. – Assessment, 2013
The Digit Span subtest from the Wechsler Scales is used to measure Freedom from Distractibility or Working Memory. Some published research suggests that Digit Span forward should be interpreted differently from Digit Span backward. The present study explored the dimensionality of the Wechsler Memory Scale-III Digit Span (forward and backward)…
Descriptors: Short Term Memory, Cognitive Tests, Factor Analysis, Correlation
Rahman, Nazia – ProQuest LLC, 2013
Samejima hypothesized that non-monotonically increasing item response functions (IRFs) of ability might occur for multiple-choice items (referred to here as "Samejima items") if low ability test takers with some, though incomplete, knowledge or skill are drawn to a particularly attractive distractor, while very low ability test takers…
Descriptors: Multiple Choice Tests, Test Items, Item Response Theory, Probability
Topczewski, Anna Marie – ProQuest LLC, 2013
Developmental score scales represent the performance of students along a continuum, where as students learn more they move higher along that continuum. Unidimensional item response theory (UIRT) vertical scaling has become a commonly used method to create developmental score scales. Research has shown that UIRT vertical scaling methods can be…
Descriptors: Item Response Theory, Scaling, Scores, Student Development
Herman, Joan; Linn, Robert – National Center for Research on Evaluation, Standards, and Student Testing (CRESST), 2013
Two consortia, the Smarter Balanced Assessment Consortium (Smarter Balanced) and the Partnership for Assessment of Readiness for College and Careers (PARCC), are currently developing comprehensive, technology-based assessment systems to measure students' attainment of the Common Core State Standards (CCSS). The consequences of the consortia…
Descriptors: Consortia, Student Evaluation, Educational Testing, Academic Standards
Mileff, Milo – Bulgarian Comparative Education Society, 2013
In the present paper and the discussion that follows, the author presents aspects of test construction and a careful description of instructional objectives. Constructing tests involves several stages such as describing language objectives, selecting appropriate test task, devising and assembling test tasks, and devising a scoring system for…
Descriptors: Behavioral Objectives, Test Construction, Norm Referenced Tests, Criterion Referenced Tests
Wiley, Colby P.; Wedeking, Travis; Galindo, Addy M. – Journal of Psychoeducational Assessment, 2013
This article reviews the Conners Early Childhood (Conners EC; Conners, 2009), a behavior and development rating scale intended to assess children in early childhood, specifically defined as ages 2 to 6 years. Using multiple informants across multiple settings, the Conners EC is administered for the purpose of early identification of disorders or…
Descriptors: Test Reviews, Rating Scales, Developmental Delays, Disability Identification

Peer reviewed
Direct link
