Publication Date
In 2025 | 2 |
Since 2024 | 4 |
Since 2021 (last 5 years) | 9 |
Since 2016 (last 10 years) | 295 |
Since 2006 (last 20 years) | 635 |
Descriptor
Statistical Analysis | 822 |
Tests | 822 |
Foreign Countries | 307 |
Scores | 238 |
Comparative Analysis | 204 |
Correlation | 167 |
Teaching Methods | 149 |
Questionnaires | 138 |
Student Attitudes | 128 |
Academic Achievement | 127 |
College Students | 125 |
More ▼ |
Source
Author
Publication Type
Education Level
Location
Turkey | 55 |
China | 14 |
Germany | 14 |
Australia | 11 |
California | 11 |
North Carolina | 11 |
Taiwan | 11 |
Canada | 10 |
Spain | 10 |
United Kingdom | 10 |
Indonesia | 9 |
More ▼ |
Laws, Policies, & Programs
Elementary and Secondary… | 4 |
American Recovery and… | 1 |
Elementary and Secondary… | 1 |
No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Does not meet standards | 1 |
Ke-Hai Yuan; Zhiyong Zhang – Grantee Submission, 2025
Most methods for structural equation modeling (SEM) focused on the analysis of covariance matrices. However, "Historically, interesting psychological theories have been phrased in terms of correlation coefficients." This might be because data in social and behavioral sciences typically do not have predefined metrics. While proper methods…
Descriptors: Correlation, Statistical Analysis, Models, Tests
San Martín, Ernesto; González, Jorge – Journal of Educational and Behavioral Statistics, 2022
The nonequivalent groups with anchor test (NEAT) design is widely used in test equating. Under this design, two groups of examinees are administered different test forms with each test form containing a subset of common items. Because test takers from different groups are assigned only one test form, missing score data emerge by design rendering…
Descriptors: Tests, Scores, Statistical Analysis, Models
Njål Foldnes; Jonas Moss; Steffen Grønneberg – Structural Equation Modeling: A Multidisciplinary Journal, 2025
We propose new ways of robustifying goodness-of-fit tests for structural equation modeling under non-normality. These test statistics have limit distributions characterized by eigenvalues whose estimates are highly unstable and biased in known directions. To take this into account, we design model-based trend predictions to approximate the…
Descriptors: Goodness of Fit, Structural Equation Models, Robustness (Statistics), Prediction
Paul T. von Hippel – Education Next, 2024
In a 1984 essay, Benjamin Bloom, an educational psychologist at the University of Chicago, asserted that tutoring offered "the best learning conditions we can devise" and that tutors could raise student achievement by two full standard deviations--or, in statistical parlance, two "sigmas." The influence of Bloom's two-sigma…
Descriptors: Tutoring, Academic Achievement, Educational Experiments, Tests
Ayva Yörü, Fatma Gökçen; Atar, Hakan Yavuz – Journal of Pedagogical Research, 2019
The aim of this study is to examine whether the items in the mathematics subtest of the Centralized High School Entrance Placement Test [HSEPT] administered in 2012 by the Ministry of National Education in Turkey show DIF according to gender and type of school. For this purpose, SIBTEST, Breslow-Day, Lord's [chi-squared] and Raju's area…
Descriptors: Test Bias, Mathematics Tests, Test Items, Gender Differences
Karun Adusumilli; Francesco Agostinelli; Emilio Borghesan – National Bureau of Economic Research, 2024
This paper examines the scalability of the results from the Tennessee Student-Teacher Achievement Ratio (STAR) Project, a prominent educational experiment. We explore how the misalignment between the experimental design and the econometric model affects researchers' ability to learn about the intervention's scalability. We document heterogeneity…
Descriptors: Class Size, Research Design, Educational Research, Program Effectiveness
Raykov, Tenko; Marcoulides, George A.; Dimitrov, Dimiter M.; Li, Tatyana – Educational and Psychological Measurement, 2018
This article extends the procedure outlined in the article by Raykov, Marcoulides, and Tong for testing congruence of latent constructs to the setting of binary items and clustering effects. In this widely used setting in contemporary educational and psychological research, the method can be used to examine if two or more homogeneous…
Descriptors: Tests, Psychometrics, Test Items, Construct Validity
Ilgun Dibek, Munevver – International Journal of Educational Methodology, 2021
Response times are one of the important sources that provide information about the performance of individuals during a test process. The main purpose of this study is to show that survival models can be used in educational data. Accordingly, data sets of items measuring literacy, numeracy and problem-solving skills of the countries participating…
Descriptors: Reaction Time, Test Items, Adults, Foreign Countries
Li, Feifei – ETS Research Report Series, 2017
An information-correction method for testlet-based tests is introduced. This method takes advantage of both generalizability theory (GT) and item response theory (IRT). The measurement error for the examinee proficiency parameter is often underestimated when a unidimensional conditional-independence IRT model is specified for a testlet dataset. By…
Descriptors: Item Response Theory, Generalizability Theory, Tests, Error of Measurement
Walstad, William B.; Rebeck, Ken – Journal of Economic Education, 2017
The "Test of Financial Literacy" (TFL) was created to measure the financial knowledge of high school students. Its content is based on the standards and benchmarks stated in the "National Standards for Financial Literacy" (Council for Economic Education 2013). The test development process involved extensive item writing and…
Descriptors: Tests, Money Management, Literacy, High School Students
Resolving Dimensionality in a Child Assessment Tool: An Application of the Multilevel Bifactor Model
Akaeze, Hope O.; Lawrence, Frank R.; Wu, Jamie Heng-Chieh – Educational and Psychological Measurement, 2023
Multidimensionality and hierarchical data structure are common in assessment data. These design features, if not accounted for, can threaten the validity of the results and inferences generated from factor analysis, a method frequently employed to assess test dimensionality. In this article, we describe and demonstrate the application of the…
Descriptors: Measures (Individuals), Multidimensional Scaling, Tests, Hierarchical Linear Modeling
Baytemir, Kemal; Ilhan, Tahsin – Electronic Journal of Research in Educational Psychology, 2018
Introduction: The aim of this study is to develop a measurement instrument for measuring the exam anxiety experienced by the parents regarding their children's exams. Method: The data were collected from two different study groups. While the first group consists of 299 parents, 169 female and 130 male, the second group consists of 200 parents, 108…
Descriptors: Test Construction, Anxiety, Parents, Test Validity
Repass, Jim T. – ProQuest LLC, 2017
Relieving test anxiety actions range from relaxation exercises to prescription medication. Humor can be a simple method of test anxiety relief. The current study was used to determine if humor, in the form of a cartoon, placed on the splash page of an online exam improved the test scores of students who have high test anxiety. In the current…
Descriptors: Test Anxiety, Statistical Analysis, Humor, Quasiexperimental Design
Brady, Shannon T.; Hard, Bridgette Martin; Gross, James J. – Journal of Educational Psychology, 2018
The idea that test anxiety hurts performance is deeply ingrained in American culture and schools. However, researchers have found that it is actually worry about performance and anxiety--not bodily feelings of anxiety (emotionality)--that impairs performance. Drawing on this insight, anxiety reappraisal interventions encourage the view that…
Descriptors: Test Anxiety, Academic Achievement, College Freshmen, Intervention
Förster, Manuel; Happ, Roland; Molerov, Dimitar – Journal of Economic Education, 2017
In this article, the authors present the adaptation and validation processes conducted to render the American "Test of Financial Literacy" (TFL) suitable for use in Germany (TFL-G). First, they outline the translation procedure followed and the various cultural adjustments made in line with international standards. Next, they present…
Descriptors: Money Management, Tests, Scores, Test Content