注冊 | 登錄讀書好,好讀書,讀好書!
讀書網(wǎng)-DuShu.com
當前位置: 首頁出版圖書科學技術計算機/網(wǎng)絡數(shù)據(jù)庫數(shù)據(jù)庫設計/管理Python數(shù)據(jù)分析(影印版)

Python數(shù)據(jù)分析(影印版)

Python數(shù)據(jù)分析(影印版)

定 價:¥68.00

作 者: (印尼)伊德里斯(Idris,I.)著
出版社: 東南大學出版社
叢編項:
標 簽: 計算機/網(wǎng)絡 數(shù)據(jù)倉庫與數(shù)據(jù)挖掘 數(shù)據(jù)庫

購買這本書可以去


ISBN: 9787564160647 出版時間: 2016-01-01 包裝: 平裝
開本: 16開 頁數(shù): 字數(shù):  

內(nèi)容簡介

  Python是一種多范式的編程語言,既適合面向對象的應用開發(fā),也適合函數(shù)式設計模式。Python已然成為數(shù)據(jù)科學家們在數(shù)據(jù)分析、可視化和機器學習方面的**語言,它可以帶來高效率和高生產(chǎn)力。伊德里斯所*的《Python數(shù)據(jù)分析(影印版)(英文版)》將教會初學者如何發(fā)掘Python的*大潛力用于數(shù)據(jù)分析,包括從數(shù)據(jù)獲取、清洗、操作、可視化以及存儲到復分析和建模等一切相關主題。它聚焦于一系列開源Python模塊,比如NumPy、SciPy、matplotlib、pandas、IPython、Cython、scikit-learn以及NLTK等。在后面的章節(jié)里,本書涵蓋了數(shù)據(jù)可視化、信號處理與時間序列分析、數(shù)據(jù)庫、可預測分析及機器學習等主題。該書可以讓你分分鐘變成**數(shù)據(jù)分析師。

作者簡介

暫缺《Python數(shù)據(jù)分析(影印版)》作者簡介

圖書目錄

Preface
Chapter 1: Getting Started with Python Libraries
  Software used in this book
    Installing software and setup
    On Windows
    On Linux
    On Mac OS X
  Building NumPy SciPy, matplotlib, and IPython from source
  Installing with setuptools
  NumPy arrays
  A simple application
  Using IPython as a shell
  Reading manual pages
  IPython notebooks
  Where to find help and references
  Summary
Chapter 2: NumPy Arrays
  The NumPy array object
    The advantages of NumPy arrays
  Creating a multidimensional array
  Selecting NumPy array elements
  NumPy numerical types
    Data type objects
    Character codes
    The dtype constructors
    The dtype attributes
  One-dimensional slicing and indexing
  Manipulating array shapes
    Stacking arrays
    Splitting NumPy arrays
    NumPy array attributes
    Converting arrays
  Creating array views and copies
  Fancy indexing
  Indexing with a list of locations
  Indexing NumPy arrays with Booleans
  Broadcasting NumPy arrays
  Summary
Chapter 3: Statistics and Linear Algebra
  NumPy and SciPy modules
  Basic descriptive statistics with NumPy
  Linear algebra with NumPy
    Inverting matrices with NumPy,
    Solving linear systems with NumPy
  Finding eigenvalues and eigenvectors with-NumPy
  NumPy random numbers
    Gambling with the binomial distribution
    Sampling the normal distribution
    Performing a normality test with SciPy
  Creating a NumPy-masked array
    Disregarding negative and extreme values
  Summary
Chapter 4: pandas Primer
  Installing and exploring pandas
  pandas DataFrames
  pandas Series
  Querying data in pandas
  Statistics with pandas DataFrames
  Data aggregation with pandas DataFrames
  Concatenating and appending DataFrames
  Joining DataFrames
  Handling missing values
  Dealing with dates
  Pivot tables
  Remote data access
  Summary
Chapter 5: Retrieving, Processing, and Storing Data
  Writing CSV files withNumPy and pandas
  Comparing the NumPy .npy binary format and pickling
  pandas DataFrames
  Storing data with PyTables
  Reading and writing pandas DataFrames to HDF5 stores
  Reading and writing to Excel with pandas
  Using REST web services and JSON
  Reading and writing JSON with pandas
  Parsing RSS and Atom feeds
  Parsing HTML with Beautiful Soup
  Summary
Chapter 6: Data Visualization
  matplotlib subpackages
  Basic matplotlib plots
  Logarithmic plots
  Scatter plots
  Legends and annotations
  Three-dimensional plots
  Plotting in pandas
  Lag plots
  Autocorrelation plots
  Plot.ly
  Summary
Chapter 7: Signal Processing and Time Series
  statsmodels subpackages
  Moving averages
  Window functions
  Defining cointegration
  Autocorrelation
  Autoregressive models
  ARMA models
  Generating periodic signals
  Fourier analysis
  Spectral analysis
  Filtering
  Summary
Chapter 8: Working with Databases
  Lightweight access with sqlite3
  Accessing databases from pandas
  SQLAIchemy
    Installing and setting up SQLAIchemy
    Populating a database with SQLAIchemy
    Querying the database with SQLAIchemy
  Pony ORM
  Dataset - databases for lazy people
  PyMongo and MongoDB
  Storing data in Redis
  Apache Cassandra
  Summary
Chapter 9: Analyzing Textual Data and Social Media
  Installing NLTK
  Filtering out stopwords, names, and numbers
  The bag-of-words model
  Analyzing word frequencies
  Naive Bayes classification
  Sentiment analysis
  Creating word clouds
  Social network analysis
  Summary
Chapter 10: Predictive Analytics and Machine Learning
  A tour of scikit-learn
  Preprocessing
  Classification with logistic regression
  Classification with support vector machines
  Regression with ElasticNetCV
  Support vector regression
  Clustering with affinity propagation
  Mean Shift
  Genetic algorithms
  Neural networks
  Decision trees
  Summary
Chapter 11: Environments Outside the Python Ecosystem and Cloud Computing
  Exchanging information with MATLAB/Octave
  Installing rpy2
  Interfacing with R
  Sending NumPy arrays to Java
  Integrating SWIG and NumPy
  Integrating Boost and Python
  Using Fortran code through f2py
  Setting up Google App Engine
  Running programs on PythonAnywhere
  Working with Wakari
  Summary
Chapter 12: Performance Tuning, Profiling, and Concurrency
  Profiling the code
  Installing Cython
  Calling C code
  Creating a process pool with multiprocessing
  Speeding up embarrassingly parallel for loops with Joblib
  Comparing Bottleneck to NumPy functions
  Performing MapReduce with Jug
  Installing MPI for Python
  IPython Parallel
  Summary
Appendix A: Key Concepts
Appendix B: Useful Functions
  matplotlib
  NumPy
  pandas
  Scikit-learn
  SciPy
    scipy.fftpack
    scipy.signal
    scipy.stats
Appendix C: Online Resources
Index

本目錄推薦

掃描二維碼
Copyright ? 讀書網(wǎng) m.ranfinancial.com 2005-2020, All Rights Reserved.
鄂ICP備15019699號 鄂公網(wǎng)安備 42010302001612號