| PISA(Program for International Student Assessment) is an international student academic achievement survey and comparison program organized and implemented by the Organization for Economic Cooperation and Development(OECD). On account of using a series of advanced education measure theory and mature operation mode and its high comparabilityã€credibility〠effectiveness, PISA has been recognized widely in the domain of world education. Scientific literacy assessment is one of main fields of PISA and has a strong resemblance of Performance Tests for Junior Secondary School Graduates(hereinafter referred to as Performance Tests) in terms of the assessment idea, respondents and assessment methods, so the research of test items of Performance Tests is very meaningful from the perspective of PISA science framework.The author will analyze the characteristics and the quality of test construction of science Test items of Performance Tests in Ningbo from the perspective of PISA science framework and puts forward some specific proposals.In this research, the author analyzes mainly the main content of PISA 2015 science framework and three science examples in it and summarizes the features of PISA 2015 science assessment test items: test construction are based on context;assessment content is widely; assessment carriers are simulation-based; evaluation criteria is diversified; process of test construction is standardized. Then, I analyze the probable problems in the development of Scientific Test items of Performance Tests in Ningbo from the point of the assessment context, evaluation content, evaluation carriers, test construction, question types, standard for scoring and made several recommendations to improve these problems. I think that the main problems in development of science Test items of Performance Tests in Ningbo are: the use of the context is lack of assessment model, so there isn’t context selection criteria as the basis for developing test items; development of test items mainly relies on personal experience of the setter of an examination paper, lack of using of test development technology, which may affect the reliability and validity of test items; assessment methods are paper and pencil tests,there isn’t the use of computer technologies; they consider mainly distribution of knowledge points and weight when developing scoring standards, and take into account exam results instead of the measurement of student ability levels. I made the following recommendations to improve the above mentioned problems: view Performance Tests as a system to improve quality of test construction; establish context framework used in test construction to improve the quality of situational text items; perfect technology of test items development to improve the reliability and validity of science Test items of Performance Tests in Ningbo; applicate new assessment technology and methods to promote the upgrading of the evaluation effects. |