Sharing data and models in software engineering-西安交通大学图书馆

Sharing data and models in software engineering

发布日期:2016-06-06 浏览次

[内容简介]

Data Science for Software Engineering: Sharing Data and Models presents guidance and procedures for reusing data and models between projects to produce results that are useful and relevant. Starting with a background section of practical lessons and warnings for beginner data scientists for software engineering, this edited volume proceeds to identify critical questions of contemporary software engineering related to data and models. Learn how to adapt data from other organizations to local problems, mine privatized data, prune spurious information, simplify complex results, how to update models for new platforms, and more. Chapters share largely applicable experimental results discussed with the blend of practitioner focused domain expertise, with commentary that highlights the methods that are most useful, and applicable to the widest range of projects. Each chapter is written by a prominent expert and offers a state-of-the-art solution to an identified problem facing data scientists in software engineering. Throughout, the editors share best practices collected from their experience training software engineering students and practitioners to master data science, and highlight the methods that are most useful, and applicable to the widest range of projects.

Shares the specific experience of leading researchers and techniques developed to handle data problems in the realm of software engineering
Explains how to start a project of data science for software engineering as well as how to identify and avoid likely pitfalls
Provides a wide range of useful qualitative and quantitative principles ranging from very simple to cutting edge research
Addresses current challenges with software engineering data such as lack of local data, access issues due to data privacy, increasing data quality via cleaning of spurious chunks in data

[目录]

Introduction
Data Science 101
Cross company data: Friend or Foe?
Pruning: Relevancy is the Removal of Irrelevancy
Easy Path: Smarter Design
Instance Weighting: How not to elaborate on analogies
Privacy: Data in Disguise
Stability: How to find a silver-bullet model?
Complexity: How to ensemble multiple models?

上一条：Anonymous communication networks : protecting privacy on the web
下一条：Joe Celko's SQL for smarties : advanced SQL programming

【关闭】