Machine learning : (Record no. 60295)

000 -LEADER
fixed length control field 14591nam a22003737a 4500
001 - CONTROL NUMBER
control field 18238705
003 - CONTROL NUMBER IDENTIFIER
control field CITU
005 - DATE AND TIME OF LATEST TRANSACTION
control field 20230216171454.0
007 - PHYSICAL DESCRIPTION FIXED FIELD--GENERAL INFORMATION
fixed length control field cr an aaaaa
008 - FIXED-LENGTH DATA ELEMENTS--GENERAL INFORMATION
fixed length control field 140723s2015 inu 000 0 eng
010 ## - LIBRARY OF CONGRESS CONTROL NUMBER
LC control number 2014946682
020 ## - INTERNATIONAL STANDARD BOOK NUMBER
International Standard Book Number 9781118889398
020 ## - INTERNATIONAL STANDARD BOOK NUMBER
International Standard Book Number 9781119183464 (ebook)
020 ## - INTERNATIONAL STANDARD BOOK NUMBER
International Standard Book Number 9781119642183
040 ## - CATALOGING SOURCE
Original cataloging agency DLC
Language of cataloging eng
Description conventions rda
Transcribing agency DLC
Modifying agency DLC
041 ## - LANGUAGE CODE
Language code of text/sound track or separate title eng.
050 00 - LIBRARY OF CONGRESS CALL NUMBER
Classification number MLCM 2018/46448 (Q)
082 ## - DEWEY DECIMAL CLASSIFICATION NUMBER
Classification number 006.31
100 1# - MAIN ENTRY--PERSONAL NAME
Preferred name for the person Bell, Jason,
Relator term author.
245 10 - TITLE STATEMENT
Title Machine learning :
Remainder of title hands-on for developers and technical professionals /
Statement of responsibility, etc Jason Bell.
264 #1 - PUBLICATION, DISTRIBUTION, ETC. (IMPRINT)
Place of publication, distribution, etc Indianapolis, Indiana :
Name of publisher, distributor, etc Wiley,
Date of publication, distribution, etc [2015]
300 ## - PHYSICAL DESCRIPTION
Extent 1 online resource (xxiv, 380 pages) :
Other physical details illustrations
336 ## - CONTENT TYPE
Content type term text
Content type code txt
Source rdacontent
337 ## - MEDIA TYPE
Media type term computer
Media type code n
Source rdamedia
338 ## - CARRIER TYPE
Carrier type term online resource
Carrier type code nc
Source rdacarrier
500 ## - GENERAL NOTE
General note Author Bios<br/>Jason Bell has been working with point of sale and customer loyalty data since 2002 and has been involved in software development for more than 25 years. He works as a senior technical architect, lecturer and also advises startups that are just beginning their technical adventures.
505 ## - CONTENTS
Formatted contents note TABLE OF CONTENTS<br/>Introduction xix<br/><br/>Chapter 1 What Is Machine Learning? 1<br/><br/>History of Machine Learning 1<br/><br/>Alan Turing 1<br/><br/>Arthur Samuel 2<br/><br/>Tom M. Mitchell 2<br/><br/>Summary Definition 2<br/><br/>Algorithm Types for Machine Learning 3<br/><br/>Supervised Learning 3<br/><br/>Unsupervised Learning 3<br/><br/>The Human Touch 4<br/><br/>Uses for Machine Learning 4<br/><br/>Software 4<br/><br/>Stock Trading 5<br/><br/>Robotics 6<br/><br/>Medicine and Healthcare 6<br/><br/>Advertising 6<br/><br/>Retail and E-Commerce 7<br/><br/>Gaming Analytics 8<br/><br/>The Internet of Things 9<br/><br/>Languages for Machine Learning 10<br/><br/>Python 10<br/><br/>R 10<br/><br/>Matlab 10<br/><br/>Scala 10<br/><br/>Clojure 11<br/><br/>Ruby 11<br/><br/>Software Used in This Book 11<br/><br/>Checking the Java Version 11<br/><br/>Weka Toolkit 12<br/><br/>Mahout 12<br/><br/>SpringXD 13<br/><br/>Hadoop 13<br/><br/>Using an IDE 14<br/><br/>Data Repositories 14<br/><br/>UC Irvine Machine Learning Repository 14<br/><br/>Infochimps 14<br/><br/>Kaggle 15<br/><br/>Summary 15<br/><br/>Chapter 2 Planning for Machine Learning 17<br/><br/>The Machine Learning Cycle 17<br/><br/>It All Starts with a Question 18<br/><br/>I Don’t Have Data! 19<br/><br/>Starting Local 19<br/><br/>Competitions 19<br/><br/>One Solution Fits All? 20<br/><br/>Defining the Process 20<br/><br/>Planning 20<br/><br/>Developing 21<br/><br/>Testing 21<br/><br/>Reporting 21<br/><br/>Refining 22<br/><br/>Production 22<br/><br/>Building a Data Team 22<br/><br/>Mathematics and Statistics 22<br/><br/>Programming 23<br/><br/>Graphic Design 23<br/><br/>Domain Knowledge 23<br/><br/>Data Processing 23<br/><br/>Using Your Computer 24<br/><br/>A Cluster of Machines 24<br/><br/>Cloud-Based Services 24<br/><br/>Data Storage 25<br/><br/>Physical Discs 25<br/><br/>Cloud-Based Storage 25<br/><br/>Data Privacy 25<br/><br/>Cultural Norms 25<br/><br/>Generational Expectations 26<br/><br/>The Anonymity of User Data 26<br/><br/>Don’t Cross “The Creepy Line” 27<br/><br/>Data Quality and Cleaning 28<br/><br/>Presence Checks 28<br/><br/>Type Checks 29<br/><br/>Length Checks 29<br/><br/>Range Checks 30<br/><br/>Format Checks 30<br/><br/>The Britney Dilemma 30<br/><br/>What’s in a Country Name? 33<br/><br/>Dates and Times 35<br/><br/>Final Thoughts on Data Cleaning 35<br/><br/>Thinking about Input Data 36<br/><br/>Raw Text 36<br/><br/>Comma Separated Variables 36<br/><br/>JSON 37<br/><br/>YAML 39<br/><br/>XML 39<br/><br/>Spreadsheets 40<br/><br/>Databases 41<br/><br/>Thinking about Output Data 42<br/><br/>Don’t Be Afraid to Experiment 42<br/><br/>Summary 43<br/><br/>Chapter 3 Working with Decision Trees 45<br/><br/>The Basics of Decision Trees 45<br/><br/>Uses for Decision Trees 45<br/><br/>Advantages of Decision Trees 46<br/><br/>Limitations of Decision Trees 46<br/><br/>Different Algorithm Types 47<br/><br/>How Decision Trees Work 48<br/><br/>Decision Trees in Weka 53<br/><br/>The Requirement 53<br/><br/>Training Data 53<br/><br/>Using Weka to Create a Decision Tree 55<br/><br/>Creating Java Code from the Classifi cation 60<br/><br/>Testing the Classifi er Code 64<br/><br/>Thinking about Future Iterations 66<br/><br/>Summary 67<br/><br/>Chapter 4 Bayesian Networks 69<br/><br/>Pilots to Paperclips 69<br/><br/>A Little Graph Theory 70<br/><br/>A Little Probability Theory 72<br/><br/>Coin Flips 72<br/><br/>Conditional Probability 72<br/><br/>Winning the Lottery 73<br/><br/>Bayes’ Theorem 73<br/><br/>How Bayesian Networks Work 75<br/><br/>Assigning Probabilities 76<br/><br/>Calculating Results 77<br/><br/>Node Counts 78<br/><br/>Using Domain Experts 78<br/><br/>A Bayesian Network Walkthrough 79<br/><br/>Java APIs for Bayesian Networks 79<br/><br/>Planning the Network 79<br/><br/>Coding Up the Network 81<br/><br/>Summary 90<br/><br/>Chapter 5 Artificial Neural Networks 91<br/><br/>What Is a Neural Network? 91<br/><br/>Artificial Neural Network Uses 92<br/><br/>High-Frequency Trading 92<br/><br/>Credit Applications 93<br/><br/>Data Center Management 93<br/><br/>Robotics 93<br/><br/>Medical Monitoring 93<br/><br/>Breaking Down the Artifi cial Neural Network 94<br/><br/>Perceptrons 94<br/><br/>Activation Functions 95<br/><br/>Multilayer Perceptrons 96<br/><br/>Back Propagation 98<br/><br/>Data Preparation for Artifi cial Neural Networks 99<br/><br/>Artificial Neural Networks with Weka 100<br/><br/>Generating a Dataset 100<br/><br/>Loading the Data into Weka 102<br/><br/>Configuring the Multilayer Perceptron 103<br/><br/>Training the Network 105<br/><br/>Altering the Network 108<br/><br/>Increasing the Test Data Size 108<br/><br/>Implementing a Neural Network in Java 109<br/><br/>Create the Project 109<br/><br/>The Code 111<br/><br/>Converting from CSV to Arff 114<br/><br/>Running the Neural Network 114<br/><br/>Summary 115<br/><br/>Chapter 6 Association Rules Learning 117<br/><br/>Where Is Association Rules Learning Used? 117<br/><br/>Web Usage Mining 118<br/><br/>Beer and Diapers 118<br/><br/>How Association Rules Learning Works 119<br/><br/>Support 121<br/><br/>Confidence 121<br/><br/>Lift 122<br/><br/>Conviction 122<br/><br/>Defining the Process 122<br/><br/>Algorithms 123<br/><br/>Apriori 123<br/><br/>FP-Growth 124<br/><br/>Mining the Baskets—A Walkthrough 124<br/><br/>Downloading the Raw Data 124<br/><br/>Setting Up the Project in Eclipse 125<br/><br/>Setting Up the Items Data File 126<br/><br/>Setting Up the Data 129<br/><br/>Running Mahout 131<br/><br/>Inspecting the Results 133<br/><br/>Putting It All Together 135<br/><br/>Further Development 136<br/><br/>Summary 137<br/><br/>Chapter 7 Support Vector Machines 139<br/><br/>What Is a Support Vector Machine? 139<br/><br/>Where Are Support Vector Machines Used? 140<br/><br/>The Basic Classifi cation Principles 140<br/><br/>Binary and Multiclass Classifi cation 140<br/><br/>Linear Classifi ers 142<br/><br/>Confidence 143<br/><br/>Maximizing and Minimizing to Find the Line 143<br/><br/>How Support Vector Machines Approach Classifi cation 144<br/><br/>Using Linear Classifi cation 144<br/><br/>Using Non-Linear Classifi cation 146<br/><br/>Using Support Vector Machines in Weka 147<br/><br/>Installing LibSVM 147<br/><br/>A Classification Walkthrough 148<br/><br/>Implementing LibSVM with Java 154<br/><br/>Summary 159<br/><br/>Chapter 8 Clustering 161<br/><br/>What Is Clustering? 161<br/><br/>Where Is Clustering Used? 162<br/><br/>The Internet 162<br/><br/>Business and Retail 163<br/><br/>Law Enforcement 163<br/><br/>Computing 163<br/><br/>Clustering Models 164<br/><br/>How the K-Means Works 164<br/><br/>Calculating the Number of Clusters in a Dataset 166<br/><br/>K-Means Clustering with Weka 168<br/><br/>Preparing the Data 168<br/><br/>The Workbench Method 169<br/><br/>The Command-Line Method 174<br/><br/>The Coded Method 178<br/><br/>Summary 186<br/><br/>Chapter 9 Machine Learning in Real Time with Spring XD 187<br/><br/>Capturing the Firehose of Data 187<br/><br/>Considerations of Using Data in Real Time 188<br/><br/>Potential Uses for a Real-Time System 188<br/><br/>Using Spring XD 189<br/><br/>Spring XD Streams 190<br/><br/>Input Sources, Sinks, and Processors 190<br/><br/>Learning from Twitter Data 193<br/><br/>The Development Plan 193<br/><br/>Configuring the Twitter API Developer Application 194<br/><br/>Configuring Spring XD 196<br/><br/>Starting the Spring XD Server 197<br/><br/>Creating Sample Data 198<br/><br/>The Spring XD Shell 198<br/><br/>Streams 101 199<br/><br/>Spring XD and Twitter 202<br/><br/>Setting the Twitter Credentials 202<br/><br/>Creating Your First Twitter Stream 203<br/><br/>Where to Go from Here 205<br/><br/>Introducing Processors 206<br/><br/>How Processors Work within a Stream 206<br/><br/>Creating Your Own Processor 207<br/><br/>Real-Time Sentiment Analysis 215<br/><br/>How the Basic Analysis Works 215<br/><br/>Creating a Sentiment Processor 217<br/><br/>Spring XD Taps 221<br/><br/>Summary 222<br/><br/>Chapter 10 Machine Learning as a Batch Process 223<br/><br/>Is It Big Data? 223<br/><br/>Considerations for Batch Processing Data 224<br/><br/>Volume and Frequency 224<br/><br/>How Much Data? 225<br/><br/>Which Process Method? 225<br/><br/>Practical Examples of Batch Processes 225<br/><br/>Hadoop 225<br/><br/>Sqoop 226<br/><br/>Pig 226<br/><br/>Mahout 226<br/><br/>Cloud-Based Elastic Map Reduce 226<br/><br/>A Note about the Walkthroughs 227<br/><br/>Using the Hadoop Framework 227<br/><br/>The Hadoop Architecture 227<br/><br/>Setting Up a Single-Node Cluster 229<br/><br/>How MapReduce Works 233<br/><br/>Mining the Hashtags 234<br/><br/>Hadoop Support in Spring XD 235<br/><br/>Objectives for This Walkthrough 235<br/><br/>What’s a Hashtag? 235<br/><br/>Creating the MapReduce Classes 236<br/><br/>Performing ETL on Existing Data 247<br/><br/>Product Recommendation with Mahout 250<br/><br/>Mining Sales Data 256<br/><br/>Welcome to My Coffee Shop! 257<br/><br/>Going Small Scale 258<br/><br/>Writing the Core Methods 258<br/><br/>Using Hadoop and MapReduce 260<br/><br/>Using Pig to Mine Sales Data 263<br/><br/>Scheduling Batch Jobs 273<br/><br/>Summary 274<br/><br/>Chapter 11 Apache Spark 275<br/><br/>Spark: A Hadoop Replacement? 275<br/><br/>Java, Scala, or Python? 276<br/><br/>Scala Crash Course 276<br/><br/>Installing Scala 276<br/><br/>Packages 277<br/><br/>Data Types 277<br/><br/>Classes 278<br/><br/>Calling Functions 278<br/><br/>Operators 279<br/><br/>Control Structures 279<br/><br/>Downloading and Installing Spark 280<br/><br/>A Quick Intro to Spark 280<br/><br/>Starting the Shell 281<br/><br/>Data Sources 282<br/><br/>Testing Spark 282<br/><br/>Spark Monitor 284<br/><br/>Comparing Hadoop MapReduce to Spark 285<br/><br/>Writing Standalone Programs with Spark 288<br/><br/>Spark Programs in Scala 288<br/><br/>Installing SBT 288<br/><br/>Spark Programs in Java 291<br/><br/>Spark Program Summary 295<br/><br/>Spark SQL 295<br/><br/>Basic Concepts 295<br/><br/>Using SparkSQL with RDDs 296<br/><br/>Spark Streaming 305<br/><br/>Basic Concepts 305<br/><br/>Creating Your First Stream with Scala 306<br/><br/>Creating Your First Stream with Java 309<br/><br/>MLib: The Machine Learning Library 311<br/><br/>Dependencies 311<br/><br/>Decision Trees 312<br/><br/>Clustering 313<br/><br/>Summary 313<br/><br/>Chapter 12 Machine Learning with R 315<br/><br/>Installing R 315<br/><br/>Mac OSX 315<br/><br/>Windows 316<br/><br/>Linux 316<br/><br/>Your First Run 316<br/><br/>Installing R-Studio 317<br/><br/>The R Basics 318<br/><br/>Variables and Vectors 318<br/><br/>Matrices 319<br/><br/>Lists 320<br/><br/>Data Frames 321<br/><br/>Installing Packages 322<br/><br/>Loading in Data 323<br/><br/>Plotting Data 324<br/><br/>Simple Statistics 327<br/><br/>Simple Linear Regression 329<br/><br/>Creating the Data 329<br/><br/>The Initial Graph 329<br/><br/>Regression with the Linear Model 330<br/><br/>Making a Prediction 331<br/><br/>Basic Sentiment Analysis 331<br/><br/>Functions to Load in Word Lists 331<br/><br/>Writing a Function to Score Sentiment 332<br/><br/>Testing the Function 333<br/><br/>Apriori Association Rules 333<br/><br/>Installing the ARules Package 334<br/><br/>The Training Data 334<br/><br/>Importing the Transaction Data 335<br/><br/>Running the Apriori Algorithm 336<br/><br/>Inspecting the Results 336<br/><br/>Accessing R from Java 337<br/><br/>Installing the rJava Package 337<br/><br/>Your First Java Code in R 337<br/><br/>Calling R from Java Programs 338<br/><br/>Setting Up an Eclipse Project 338<br/><br/>Creating the Java/R Class 339<br/><br/>Running the Example 340<br/><br/>Extending Your R Implementations 342<br/><br/>R and Hadoop 342<br/><br/>The RHadoop Project 342<br/><br/>A Sample Map Reduce Job in RHadoop 343<br/><br/>Connecting to Social Media with R 345<br/><br/>Summary 347<br/><br/>Appendix A SpringXD Quick Start 349<br/><br/>Installing Manually 349<br/><br/>Starting SpringXD 349<br/><br/>Creating a Stream 350<br/><br/>Adding a Twitter Application Key 350<br/><br/>Appendix B Hadoop 1.x Quick Start 351<br/><br/>Downloading and Installing Hadoop 351<br/><br/>Formatting the HDFS Filesystem 352<br/><br/>Starting and Stopping Hadoop 353<br/><br/>Process List of a Basic Job 353<br/><br/>Appendix C Useful Unix Commands 355<br/><br/>Using Sample Data 355<br/><br/>Showing the Contents: cat, more, and less 356<br/><br/>Example Command 356<br/><br/>Expected Output 356<br/><br/>Filtering Content: grep 357<br/><br/>Example Command for Finding Text 357<br/><br/>Example Output 357<br/><br/>Sorting Data: sort 358<br/><br/>Example Command for Basic Sorting 358<br/><br/>Example Output 358<br/><br/>Finding Unique Occurrences: uniq 360<br/><br/>Showing the Top of a File: head 361<br/><br/>Counting Words: wc 361<br/><br/>Locating Anything: fi nd 362<br/><br/>Combining Commands and Redirecting Output 363<br/><br/>Picking a Text Editor 363<br/><br/>Colon Frenzy: Vi and Vim 363<br/><br/>Nano 364<br/><br/>Emacs 364<br/><br/>Appendix D Further Reading 367<br/><br/>Machine Learning 367<br/><br/>Statistics 368<br/><br/>Big Data and Data Science 368<br/><br/>Hadoop 368<br/><br/>Visualization 369<br/><br/>Making Decisions 369<br/><br/>Datasets 369<br/><br/>Blogs 370<br/><br/>Useful Websites 370<br/><br/>The Tools of the Trade 370<br/><br/>Index 373
520 ## - SUMMARY, ETC.
Summary, etc Dig deep into the data with a hands-on guide to machine learning<br/>Machine Learning: Hands-On for Developers and Technical Professionals provides hands-on instruction and fully-coded working examples for the most common machine learning techniques used by developers and technical professionals. The book contains a breakdown of each ML variant, explaining how it works and how it is used within certain industries, allowing readers to incorporate the presented techniques into their own work as they follow along. A core tenant of machine learning is a strong focus on data preparation, and a full exploration of the various types of learning algorithms illustrates how the proper tools can help any developer extract information and insights from existing data. The book includes a full complement of Instructor's Materials to facilitate use in the classroom, making this resource useful for students and as a professional reference.<br/><br/>At its core, machine learning is a mathematical, algorithm-based technology that forms the basis of historical data mining and modern big data science. Scientific analysis of big data requires a working knowledge of machine learning, which forms predictions based on known properties learned from training data. Machine Learning is an accessible, comprehensive guide for the non-mathematician, providing clear guidance that allows readers to:<br/><br/>Learn the languages of machine learning including Hadoop, Mahout, and Weka<br/>Understand decision trees, Bayesian networks, and artificial neural networks<br/>Implement Association Rule, Real Time, and Batch learning<br/>Develop a strategic plan for safe, effective, and efficient machine learning<br/>By learning to construct a system that can learn from data, readers can increase their utility across industries. Machine learning sits at the core of deep dive data analysis and visualization, which is increasingly in demand as companies discover the goldmine hiding in their existing data. For the tech professional involved in data science, Machine Learning: Hands-On for Developers and Technical Professionals provides the skills and techniques required to dig deeper.
526 ## - STUDY PROGRAM INFORMATION NOTE
-- 000-099
-- 006
650 #0 - SUBJECT ADDED ENTRY--TOPICAL TERM
Topical term or geographic name as entry element Machine learning.
655 #4 - INDEX TERM--GENRE/FORM
Genre/form data or focus term Electronic books.
856 ## - ELECTRONIC LOCATION AND ACCESS
Link text Full text available at Wiley Online Library Click here to view
Uniform Resource Identifier https://onlinelibrary.wiley.com/doi/book/10.1002/9781119183464
942 ## - ADDED ENTRY ELEMENTS
Source of classification or shelving scheme
Item type EBOOK
Holdings
Withdrawn status Lost status Source of classification or shelving scheme Damaged status Not for loan Permanent Location Current Location Shelving location Date acquired Inventory number Full call number Barcode Date last seen Price effective from Item type
          COLLEGE LIBRARY COLLEGE LIBRARY LIC Gateway 2021-03-27 50313 006.31 B4131 2015 CL-50313 2021-03-27 2021-03-27 EBOOK