數(shù)據(jù)挖掘：英文版實(shí)用機(jī)器學(xué)習(xí)技術(shù)

定　價(jià)：￥58.00

作　者：	（新西蘭）Lan H.Witten，（新西蘭）Eibe Frank著
出版社：	機(jī)械工業(yè)出版社
叢編項(xiàng)：	經(jīng)典原版書庫
標(biāo)　簽：	數(shù)據(jù)庫存儲(chǔ)與管理

購買這本書可以去

ISBN：	9787111172482	出版時(shí)間：	2005-09-01	包裝：	膠版紙
開本：	c24m	頁數(shù)：	524	字?jǐn)?shù)：

內(nèi)容簡介

　　“本書將這門新的學(xué)科用一種非常容易理解的方式呈現(xiàn)給讀者：它既是一本用于培訓(xùn)新一代實(shí)踐者和研究工作者的教科書，同時(shí)對于我這樣需要不斷充電的專業(yè)讀者也極具啟示作用。Witten和Frank熱切追求的是簡單而流暢的解決方案，他們時(shí)刻不忘將所有的概念都建立在具體實(shí)例的基礎(chǔ)之上，促使讀者首先考慮簡單的技術(shù)，如果這些簡單技術(shù)不足以解決問題，再進(jìn)一步考慮更為高級(jí)和成熟的技術(shù)。假如你想分析和理解數(shù)據(jù)，本書以及相關(guān)的Weka工具包將非常有用?！?――摘自微軟研究院圖靈獎(jiǎng)得主Jim Gray所做的前言本書對1999年的初版做了重大的改動(dòng)。雖說核心概念沒有變化，但本書進(jìn)行了更新使其能反映過去5年里的變化，參考文獻(xiàn)幾乎翻了一番。新版的重要部分包括：30個(gè)新的技術(shù)章節(jié)；一個(gè)加強(qiáng)了的具有交互式界面的Weka機(jī)器學(xué)習(xí)工作平臺(tái)；有關(guān)神經(jīng)網(wǎng)絡(luò)的完整信息，一個(gè)有關(guān)貝葉斯網(wǎng)絡(luò)的新節(jié)；等等。本書提供了機(jī)器學(xué)習(xí)概念的完整基礎(chǔ)，此外還針對實(shí)際工作中應(yīng)用相關(guān)工具和技術(shù)提出了一些建議，在本書中你將發(fā)現(xiàn)： ●成功數(shù)據(jù)挖掘技術(shù)的核心算法，包括歷經(jīng)考驗(yàn)的真實(shí)技術(shù)及前沿的方法。 ●轉(zhuǎn)換輸入或輸出以改善性能的方法。 ●可下載的Weka軟件？？一個(gè)用于數(shù)據(jù)挖掘任務(wù)的機(jī)器學(xué)習(xí)算法的集合，包括用于數(shù)據(jù)預(yù)處理、分類、回歸、聚類、關(guān)聯(lián)規(guī)則以及在新的交互式界面上可視化的工具。

作者簡介

　　LanH.Witten新西蘭懷卡托大學(xué)計(jì)算機(jī)科學(xué)系教授，ACM和新西蘭皇家學(xué)會(huì)成員。他曾榮獲2004年國際信息處理研究聯(lián)合會(huì)頒發(fā)的Namur獎(jiǎng)項(xiàng)，這是一個(gè)兩年一度的榮譽(yù)獎(jiǎng)項(xiàng)，用于獎(jiǎng)勵(lì)那些在信息和通信技術(shù)的社會(huì)應(yīng)用方面做出杰現(xiàn)貢獻(xiàn)及具有國際影響的人。他的著作包括《ManagingGigabytes》（1999）、《HowtoBuildaDigitalLibrary》（2003），以及眾多的期刊文章和會(huì)議論文。EibeFrank，新西蘭懷卡托大學(xué)計(jì)算機(jī)科學(xué)系高級(jí)講師。他在機(jī)器學(xué)習(xí)領(lǐng)域發(fā)表了大量的論文，是《MachineLearingJournal》和《JournalofArtificialIntelligenceResearch》的編委之一。同時(shí)他還是許多數(shù)據(jù)挖掘和機(jī)器學(xué)習(xí)學(xué)術(shù)會(huì)議設(shè)計(jì)委員會(huì)的成員。作為Weka機(jī)器學(xué)習(xí)軟件的核心開發(fā)成員之一，他維護(hù)并不斷完善著這個(gè)軟件。相關(guān)圖書軟件過程改進(jìn)（英文版）80X86匯編語言與計(jì)算機(jī)體系結(jié)構(gòu)計(jì)算機(jī)科學(xué)概論（英文版·第2版）分布式系統(tǒng)概念設(shè)計(jì)調(diào)和分析導(dǎo)論（英文第三版）人工智能：智能系統(tǒng)指南（英文版）第二版電力系統(tǒng)分析（英文版·第2版）面向計(jì)算機(jī)科學(xué)的數(shù)理邏輯：系統(tǒng)建模與推理（英文版·第2版）機(jī)器視覺教程（英文版）（含盤）支持向量機(jī)導(dǎo)論（英文版）Java教程（英文版·第2版）軟件需求管理：用例方法（英文版·第2版）數(shù)字通信導(dǎo)論離散事件系統(tǒng)仿真（英文版·第4版）復(fù)雜SoC設(shè)計(jì)（英文版）基于FPGA的系統(tǒng)設(shè)計(jì)（英文版）實(shí)用軟件工程（英文版）UNIX教程（英文版·第2版）軟件測試（英文版第2版）設(shè)計(jì)模式精解（英文版第2版）Linux內(nèi)核編程必讀-經(jīng)典原版書庫實(shí)分析和概率論-經(jīng)典原版書庫（英文版.第2版）計(jì)算機(jī)體系結(jié)構(gòu)：量化研究方法：第3版數(shù)學(xué)規(guī)劃導(dǎo)論英文版抽樣理論與方法（英文版）Java2專家導(dǎo)引（英文版·第3版）復(fù)分析基礎(chǔ)及工程應(yīng)用（英文版.第3版）電子設(shè)計(jì)自動(dòng)化基礎(chǔ)（英文版）Java程序設(shè)計(jì)導(dǎo)論（英文版·第5版）UML參考手冊（第2版）UML參考手冊（英文版·第2版）計(jì)算理論導(dǎo)引計(jì)算機(jī)取證（英文版）EffectiveC#（英文版）基于用例的面向方面軟件開發(fā)（英文版）

圖書目錄

Foreword
Preface
Part I Machine learning tools and techniques
1. What?s it all about?
  1.1 Data mining and machine learning
  1.2 Simple examples: the weather problem and others
  1.3 Fielded applications
  1.4 Machine learning and statistics
  1.5 Generalization as search
  1.6 Data mining and ethics
  1.7 Further reading
2. Input: Concepts, instances, attributes
  2.1 What?s a concept?
  2.2 What?s in an example?
  2.3 What?s in an attribute?
  2.4 Preparing the input
  2.5 Further reading
3. Output: Knowledge representation
  3.1 Decision tables
  3.2 Decision trees
  3.3 Classification rules
  3.4 Association rules
  3.5 Rules with exceptions
  3.6 Rules involving relations
  3.7 Trees for numeric prediction
  3.8 Instance-based representation
  3.9 Clusters
  3.10 Further reading
4. Algorithms: The basic methods
  4.1 Inferring rudimentary rules
  4.2 Statistical modeling
  4.3 Divide-and-conquer: constructing decision trees
  4.4 Covering algorithms: constructing rules
  4.5 Mining association rules
  4.6 Linear models
  4.7 Instance-based learning
  4.8 Clustering
  4.9 Further reading
5. Credibility: Evaluating what?s been learned
  5.1 Training and testing
  5.2 Predicting performance
  5.3 Cross-validation
  5.4 Other estimates
  5.5 Comparing data mining schemes
  5.6 Predicting probabilities
  5.7 Counting the cost
  5.8 Evaluating numeric prediction
  5.9 The minimum description length principle
  5.10 Applying MDL to clustering
  5.11 Further reading
6. Implementations: Real machine learning schemes
  6.1 Decision trees
  6.2 Classification rules
  6.3 Extending linear models
  6.4 Instance-based learning
  6.5 Numeric prediction
  6.6 Clustering
  6.7 Bayesian networks
7. Transformations: Engineering the input and output
  7.1 Attribute selection
  7.2 Discretizing numeric attributes
  7.3 Some useful transformations
  7.4 Automatic data cleansing
  7.5 Combining multiple models
  7.6 Using unlabeled data
  7.7 Further reading
8. Moving on: Extensions and applications
  8.1 Learning from massive datasets
  8.2 Incorporating domain knowledge
  8.3 Text and Web mining
  8.4 Adversarial situations
  8.5 Ubiquitous data mining
  8.6 Further reading
Part II: The Weka machine learning workbench
9. Introduction to Weka
  9.1 What?s in Weka?
  9.2 How do you use it?
  9.3 What else can you do?
  9.4 How do you get it?
10. The Explorer
  10.1 Getting started
  10.2 Exploring the Explorer
  10.3 Filtering algorithms
  10.4 Learning algorithms
  10.5 Meta-learning algorithms
  10.6 Clustering algorithms
  10.7 Association-rule learners
  10.8 Attribute selection
11. The Knowledge Flow interface
  11.1 Getting started
  11.2 Knowledge Flow components
  11.3 Configuring and connecting the components
  11.4 Incremental learning
12. The Experimenter
  12.1 Getting started
  12.2 Simple setup
  12.3 Advanced setup
  12.4 The Analyze panel
  12.5 Distributing processing over several machines
13. The command-line interface
  13.1 Getting started
  13.2 The structure of Weka
  13.3 Command-line options
14. Embedded machine learning
……
15. Writing new learning schemes
References
Index