Publication Date
| In 2026 | 0 |
| Since 2025 | 1 |
| Since 2022 (last 5 years) | 14 |
| Since 2017 (last 10 years) | 35 |
| Since 2007 (last 20 years) | 243 |
Descriptor
Source
Author
| Koffler, Stephen L. | 6 |
| Thurlow, Martha L. | 6 |
| White, Edward M. | 6 |
| Cai, Li | 5 |
| Lane, Suzanne | 5 |
| Zhang, Liru | 5 |
| Belcher, Marcia | 4 |
| Bowman, Harry L. | 4 |
| Buckendahl, Chad W. | 4 |
| Caffrey, Patrick | 4 |
| Cahen, Leonard S. | 4 |
| More ▼ | |
Publication Type
Education Level
Location
| Canada | 47 |
| California | 35 |
| Texas | 21 |
| Florida | 20 |
| North Carolina | 20 |
| United States | 20 |
| New Jersey | 16 |
| Louisiana | 15 |
| South Carolina | 15 |
| Georgia | 14 |
| Washington | 14 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Does not meet standards | 1 |
Linking Errors between Two Populations and Tests: A Case Study in International Surveys in Education
Hastedt, Dirk; Desa, Deana – Practical Assessment, Research & Evaluation, 2015
This simulation study was prompted by the current increased interest in linking national studies to international large-scale assessments (ILSAs) such as IEA's TIMSS, IEA's PIRLS, and OECD's PISA. Linkage in this scenario is achieved by including items from the international assessments in the national assessments on the premise that the average…
Descriptors: Case Studies, Simulation, International Programs, Testing Programs
Hansen, Mark; Cai, Li; Monroe, Scott; Li, Zhen – National Center for Research on Evaluation, Standards, and Student Testing (CRESST), 2014
It is a well-known problem in testing the fit of models to multinomial data that the full underlying contingency table will inevitably be sparse for tests of reasonable length and for realistic sample sizes. Under such conditions, full-information test statistics such as Pearson's X[superscript 2] and the likelihood ratio statistic G[superscript…
Descriptors: Goodness of Fit, Item Response Theory, Classification, Maximum Likelihood Statistics
Jiang, Feng; McComas, William F. – International Journal of Science Education, 2015
Gauging the effectiveness of specific teaching strategies remains a major topic of interest in science education. Inquiry teaching among others has been supported by extensive research and recommended by the National Science Education Standards. However, most of the empirical evidence in support was collected in research settings rather than in…
Descriptors: Inquiry, Active Learning, Science Instruction, Science Achievement
Simui, Francis; Chibale, Henry; Namangala, Boniface – Open Praxis, 2017
This paper focuses on the management of distance education examination in a lowly resourced North-Eastern region of Zambia. The study applies Hermeneutic Phenomenology approach to generate and make sense of the data. It is the lived experiences of 2 invigilators and 66 students purposively selected that the study draws its insights from. Meaning…
Descriptors: Distance Education, Phenomenology, Testing Programs, Testing
Golan, Shari; Woodbridge, Michelle; Davies-Mercier, Betsy; Pistorino, Carol – Office of Planning, Evaluation and Policy Development, US Department of Education, 2016
States increasingly are incorporating Kindergarten Entry Assessments (KEAs) into their comprehensive assessment systems with the goal of helping educators identify gaps in children's competencies, target instruction to children's individual needs, engage parents to better support their child's learning, and identify needs for expanding and…
Descriptors: Kindergarten, Tests, Testing Programs, Data Use
Yee, Mary – Teachers College Record, 2015
This study constitutes the secondary analysis of data collected as part of classroom instruction in a prior practitioner inquiry study. Consequently, IRB approval, parental consent, and participant assent for the present study were obtained after the conclusion of the original study.
Descriptors: English Language Learners, Classroom Techniques, Inquiry, Educational Legislation
Wang, Ze – Educational Psychology, 2015
Using data from the Trends in International Mathematics and Science Study (TIMSS) 2007, this study examined the big-fish-little-pond-effects (BFLPEs) in 49 countries. In this study, the effect of math ability on math self-concept was decomposed into a within- and a between-level components using implicit mean centring and the complex data…
Descriptors: Nonverbal Ability, Mathematics, Self Concept, Hierarchical Linear Modeling
Chen, Qian – International Journal of Science and Mathematics Education, 2014
In this study, the Trends in International Mathematics and Science Study 2007 data were used to build mathematics achievement models of fourth graders in two East Asian school systems: Hong Kong and Singapore. In each school system, eight variables at student level and nine variables at school/class level were incorporated to build an achievement…
Descriptors: Foreign Countries, Mathematics Achievement, Grade 4, Mathematics Tests
Brame, Cynthia J.; Biel, Rachel – CBE - Life Sciences Education, 2015
Testing within the science classroom is commonly used for both formative and summative assessment purposes to let the student and the instructor gauge progress toward learning goals. Research within cognitive science suggests, however, that testing can also be a learning event. We present summaries of studies that suggest that repeated retrieval…
Descriptors: Undergraduate Students, Testing Programs, Feedback (Response), Test Format
Livingston, Samuel A. – ETS Research Report Series, 2014
In this study, I investigated 2 procedures intended to create test-taker groups of equal ability by poststratifying on a composite variable created from demographic information. In one procedure, the stratifying variable was the composite variable that best predicted the test score. In the other procedure, the stratifying variable was the…
Descriptors: Demography, Equated Scores, Cluster Grouping, Ability Grouping
Yang, Ji Seung; Cai, Li – Journal of Educational and Behavioral Statistics, 2014
The main purpose of this study is to improve estimation efficiency in obtaining maximum marginal likelihood estimates of contextual effects in the framework of nonlinear multilevel latent variable model by adopting the Metropolis-Hastings Robbins-Monro algorithm (MH-RM). Results indicate that the MH-RM algorithm can produce estimates and standard…
Descriptors: Computation, Hierarchical Linear Modeling, Mathematics, Context Effect
Yang, Ji Seung; Cai, Li – Grantee Submission, 2014
The main purpose of this study is to improve estimation efficiency in obtaining maximum marginal likelihood estimates of contextual effects in the framework of nonlinear multilevel latent variable model by adopting the Metropolis-Hastings Robbins-Monro algorithm (MH-RM; Cai, 2008, 2010a, 2010b). Results indicate that the MH-RM algorithm can…
Descriptors: Computation, Hierarchical Linear Modeling, Mathematics, Context Effect
Alonzo, Alicia C.; Ke, Li – Measurement: Interdisciplinary Research and Perspectives, 2016
A new vision of science learning described in the "Next Generation Science Standards"--particularly the science and engineering practices and their integration with content--pose significant challenges for large-scale assessment. This article explores what might be learned from advances in large-scale science assessment and…
Descriptors: Science Achievement, Science Tests, Group Testing, Accountability
Wasserberg, Martin J.; Rottman, Amy – American Secondary Education, 2016
The purpose of this study was to examine African American and Latino student perceptions on test-centered curricular protocols in the urban high school context. Data collection occurred through observations, classroom dialogue initiated by the researchers, and individual student interviews throughout an academic semester. Findings suggest that…
Descriptors: High School Students, Student Attitudes, Urban Schools, African American Students
Hansen, Mark; Cai, Li; Monroe, Scott; Li, Zhen – Grantee Submission, 2016
Despite the growing popularity of diagnostic classification models (e.g., Rupp, Templin, & Henson, 2010) in educational and psychological measurement, methods for testing their absolute goodness-of-fit to real data remain relatively underdeveloped. For tests of reasonable length and for realistic sample size, full-information test statistics…
Descriptors: Goodness of Fit, Item Response Theory, Classification, Maximum Likelihood Statistics

Peer reviewed
Direct link
