1 00:00:00,527 --> 00:00:03,383 Welcome to How to Learn Data Science. 2 00:00:03,383 --> 00:00:08,137 In this workshop, we'll build a learning path through data science with treehouse. 3 00:00:08,137 --> 00:00:09,191 Let's get started. 4 00:00:10,885 --> 00:00:15,785 Data science is the field of study that combines programming, math, 5 00:00:15,785 --> 00:00:20,364 and domain expertise to extract meaningful insights from data. 6 00:00:20,364 --> 00:00:26,362 The phrase domain expertise refers to having knowledge in a particular field, 7 00:00:26,362 --> 00:00:30,491 for example, medicine, finance, or agriculture. 8 00:00:30,491 --> 00:00:34,910 The AI revolution is changing the way we interact with technology. 9 00:00:34,910 --> 00:00:39,013 Experts believe that as the world trains its focus on creating intelligent 10 00:00:39,013 --> 00:00:39,670 machines. 11 00:00:39,670 --> 00:00:44,074 but ways that humans interact with data science will rapidly evolve. 12 00:00:44,074 --> 00:00:48,463 This means that as a developer, understanding data science can provide you 13 00:00:48,463 --> 00:00:51,286 with an array of possibilities for your career. 14 00:00:51,286 --> 00:00:56,568 It's an extraordinary time to start learning data science, 15 00:00:56,568 --> 00:01:00,029 but you may be asking, where do I start? 16 00:01:00,029 --> 00:01:03,879 Data science can seem overwhelming because of its vastness. 17 00:01:03,879 --> 00:01:08,645 But if you map out a learning path, acquiring the necessary skills is not 18 00:01:08,645 --> 00:01:12,487 a difficult task will present one of many possible paths. 19 00:01:12,487 --> 00:01:15,506 Remember, your learning path is personal to you. 20 00:01:15,506 --> 00:01:19,409 Everyone is unique and takes a different approach to learning. 21 00:01:19,409 --> 00:01:21,572 It doesn't matter where you start. 22 00:01:21,572 --> 00:01:26,367 The important thing is that you actually start the journey. 23 00:01:26,367 --> 00:01:31,413 A smart path to learning data science begins with programming. 24 00:01:31,413 --> 00:01:35,907 Learning programming languages is an essential step in learning data science. 25 00:01:35,907 --> 00:01:40,822 Programming equips data scientists with the tools needed to perform practical 26 00:01:40,822 --> 00:01:42,077 data science work. 27 00:01:42,077 --> 00:01:46,719 Those new to programming can get started with two relatively easy programming 28 00:01:46,719 --> 00:01:47,499 languages. 29 00:01:47,499 --> 00:01:52,184 Python's simplicity and readability makes it a great choice for beginners. 30 00:01:52,184 --> 00:01:57,158 Its rich set of libraries provide data scientists with a powerful toolkit for 31 00:01:57,158 --> 00:02:02,148 performing tasks like data manipulation, analysis, and visualization. 32 00:02:02,148 --> 00:02:05,100 Sequel is a domain-specific language designed for 33 00:02:05,100 --> 00:02:07,050 managing and manipulating data. 34 00:02:07,050 --> 00:02:11,120 Almost all organizations store data in databases, 35 00:02:11,120 --> 00:02:15,289 making SQL a critical language for data scientists. 36 00:02:15,289 --> 00:02:17,861 Starting with these two versatile and 37 00:02:17,861 --> 00:02:23,338 easy-to-learn languages is a great way to begin your journey in data science. 38 00:02:23,338 --> 00:02:27,416 The next step on our path is mathematics. 39 00:02:27,416 --> 00:02:31,433 Math serves as a fundamental pillar in the field of data science, 40 00:02:31,433 --> 00:02:36,341 providing essential tools and concepts that underpin the entire discipline. 41 00:02:36,341 --> 00:02:41,056 Linear Algebra is used in machine learning for tasks such as image processing and 42 00:02:41,056 --> 00:02:42,937 natural language processing. 43 00:02:42,937 --> 00:02:47,751 Key concepts to learn include matrix manipulation and vector spaces. 44 00:02:47,751 --> 00:02:52,088 Calculus is used in data science to optimize machine learning models and 45 00:02:52,088 --> 00:02:54,874 to calculate probabilities and statistics. 46 00:02:54,874 --> 00:02:59,641 Calculus involves two main concepts, derivatives and integrals. 47 00:02:59,641 --> 00:03:02,856 Start by understanding these two concepts. 48 00:03:02,856 --> 00:03:06,663 Probability and statistics are used to analyze data and 49 00:03:06,663 --> 00:03:09,262 to make decisions based on that data. 50 00:03:09,262 --> 00:03:12,898 Learn the fundamental concepts of mean, median, 51 00:03:12,898 --> 00:03:16,373 and mode, variance, and standard deviation. 52 00:03:16,373 --> 00:03:21,686 These disciplines make up the foundation of many data science techniques and 53 00:03:21,686 --> 00:03:22,672 algorithms. 54 00:03:22,672 --> 00:03:27,361 Without a strong mathematical foundation, it's difficult to fully 55 00:03:27,361 --> 00:03:32,777 understand how these techniques work and to apply them in real world scenarios. 56 00:03:32,777 --> 00:03:37,066 Machine learning is the next stop on our path. 57 00:03:37,066 --> 00:03:41,472 Machine learning is a subset of artificial intelligence that focuses on 58 00:03:41,472 --> 00:03:43,134 developing algorithms and 59 00:03:43,134 --> 00:03:48,494 models that enable computers to learn from data without being explicitly programmed. 60 00:03:48,494 --> 00:03:53,478 Start by learning the basic concepts of machine learning, such as unsupervised and 61 00:03:53,478 --> 00:03:57,070 supervised learning, clustering, and neural networks. 62 00:03:57,070 --> 00:04:00,905 Machine learning algorithms have a backbone of any machine learning 63 00:04:00,905 --> 00:04:01,771 application. 64 00:04:01,771 --> 00:04:06,580 They're used to analyze data, identify patterns, and make predictions. 65 00:04:06,580 --> 00:04:12,208 By using algorithms like linear and logistic regression, and decision trees. 66 00:04:12,208 --> 00:04:15,201 You can quickly identify patterns and trends and 67 00:04:15,201 --> 00:04:18,127 data that might not be visible to the naked eye. 68 00:04:18,127 --> 00:04:22,453 Then gain experience of machine learning tools and libraries. 69 00:04:22,453 --> 00:04:27,263 A few popular options are scikit-learn, a Python library that includes 70 00:04:27,263 --> 00:04:32,705 well-designed tools for performing common machine learning tasks and PyTorch. 71 00:04:32,705 --> 00:04:37,137 A Python based machine learning framework that simplifies the development and 72 00:04:37,137 --> 00:04:41,117 training of deep learning models with its ease of use and flexibility. 73 00:04:41,117 --> 00:04:44,261 Making it a preferred choice for researchers and 74 00:04:44,261 --> 00:04:48,029 practitioners in the field of Artificial Intelligence. 75 00:04:48,029 --> 00:04:51,935 Machine learning can be leveraged to predict customer churn, 76 00:04:51,935 --> 00:04:55,177 forecast demand for a product and identify fraud and 77 00:04:55,177 --> 00:04:58,211 financial transactions in all of these fields. 78 00:04:58,211 --> 00:05:03,726 And many more professionals with the skills are in high demand. 79 00:05:03,726 --> 00:05:09,460 Data foundations is a good next step in learning data science. 80 00:05:09,460 --> 00:05:13,026 Working with data is a fundamental skill in data science, 81 00:05:13,026 --> 00:05:17,978 allowing data scientists to extract insights from data, identify patterns and 82 00:05:17,978 --> 00:05:21,279 relationships, and represent data in visual form. 83 00:05:21,279 --> 00:05:26,158 Data scientists spend significant time preparing data before analyzing it. 84 00:05:26,158 --> 00:05:30,258 And this is where data wrangling and manipulation become critical. 85 00:05:30,258 --> 00:05:33,745 This involves techniques like cleaning, filtering, and 86 00:05:33,745 --> 00:05:38,140 sorting and transforming raw data that's often messy and unstructured, 87 00:05:38,140 --> 00:05:41,712 which can affect the accuracy and validity of the analysis. 88 00:05:41,712 --> 00:05:46,587 Once the data has been transformed into a suitable format, it can be analyzed to 89 00:05:46,587 --> 00:05:51,127 extract insights and identify patterns, trends, and relationships. 90 00:05:51,127 --> 00:05:55,769 Data visualization is the process of representing data in a visual form. 91 00:05:55,769 --> 00:06:00,434 As a beginner, the easiest way to learn data visualization is to start with 92 00:06:00,434 --> 00:06:03,480 basic charts and graphs using tools like Excel. 93 00:06:03,480 --> 00:06:08,023 A popular spreadsheet program that includes basic data visualization 94 00:06:08,023 --> 00:06:08,785 features. 95 00:06:08,785 --> 00:06:13,696 And Tableau, tool that allows users to connect to various data sources, 96 00:06:13,696 --> 00:06:18,223 create interactive dashboards, and share insights with others. 97 00:06:18,223 --> 00:06:23,388 Additionally, it can be helpful to learn some basic principles of visual design, 98 00:06:23,388 --> 00:06:26,504 such as color theory, typography, and layout. 99 00:06:26,504 --> 00:06:31,098 By understanding these principles, you can create visualizations that 100 00:06:31,098 --> 00:06:35,851 are both aesthetically pleasing and effective at conveying information. 101 00:06:35,851 --> 00:06:40,692 Having a solid understanding of the methods and tools used to analyze and 102 00:06:40,692 --> 00:06:46,261 derive insights from data will set you up for success in your data science journey. 103 00:06:46,261 --> 00:06:50,900 Gaining domain knowledge is an important step on the path to 104 00:06:50,900 --> 00:06:52,952 a career in data science. 105 00:06:52,952 --> 00:06:57,638 Domain knowledge is important when studying data science because it provides 106 00:06:57,638 --> 00:07:02,397 context of a specific industry or field, but the data science work applies to. 107 00:07:02,397 --> 00:07:06,392 Here are four ways to gain experience in a particular domain. 108 00:07:06,392 --> 00:07:10,348 Identify the specific domain or industry you want to learn about. 109 00:07:10,348 --> 00:07:13,674 This will help you focus your efforts and resources. 110 00:07:13,674 --> 00:07:17,442 Working in the industry is the best way to gain domain knowledge. 111 00:07:17,442 --> 00:07:20,397 Look for internships, entry level jobs, or 112 00:07:20,397 --> 00:07:23,915 volunteer opportunities in your field of interest. 113 00:07:23,915 --> 00:07:27,647 Read industry publications, blogs, and newsletters. 114 00:07:27,647 --> 00:07:30,904 This will help you stay up to date with the latest trends and 115 00:07:30,904 --> 00:07:32,777 developments in your industry. 116 00:07:32,777 --> 00:07:36,230 Attend conferences and events relevant to your industry. 117 00:07:36,230 --> 00:07:40,787 Look for opportunities to network with other professionals and 118 00:07:40,787 --> 00:07:43,241 learn from experts in your field. 119 00:07:43,241 --> 00:07:47,379 Without domain knowledge, a data scientists may not understand the nuances 120 00:07:47,379 --> 00:07:50,187 and intricacies of the data they're working with. 121 00:07:50,187 --> 00:07:53,859 Leading to inaccurate or incomplete analysis and conclusions. 122 00:07:56,462 --> 00:08:00,870 And there you have it, a smart learning path for you to follow. 123 00:08:00,870 --> 00:08:05,343 Many people have built their data science skills along the same path we just 124 00:08:05,343 --> 00:08:09,190 explored and launched new careers in data science on their own. 125 00:08:09,190 --> 00:08:11,942 If a career in data science sounds right for you, 126 00:08:11,942 --> 00:08:14,909 you can start building your skills with treehouse. 127 00:08:14,909 --> 00:08:18,233 The demand for skilled professionals is growing every day. 128 00:08:18,233 --> 00:08:22,821 And tech companies around the world are removing formal degree requirements for 129 00:08:22,821 --> 00:08:26,137 tech positions to access a bigger pool of job candidates. 130 00:08:27,803 --> 00:08:30,122 What does this mean for you? 131 00:08:30,122 --> 00:08:35,054 You can start a career in data science right here at treehouse, 132 00:08:35,054 --> 00:08:36,832 no diploma required. 133 00:08:36,832 --> 00:08:41,521 In addition to individual courses perfect for beginning coders, treehouse offers for 134 00:08:41,521 --> 00:08:42,669 Python Techdegree. 135 00:08:42,669 --> 00:08:46,996 A self paced interactive bootcamp where you will learn to build apps and 136 00:08:46,996 --> 00:08:51,045 work with data while building a portfolio and getting certified. 137 00:08:51,045 --> 00:08:53,877 The Techdegrees project-based curriculum and 138 00:08:53,877 --> 00:08:58,581 incredible student support is a great place to start your data science journey. 139 00:08:58,581 --> 00:09:02,227 We also offer the beginning Python track, a guided curriculum 140 00:09:02,227 --> 00:09:06,849 that includes 14 hours of courses and workshops, starting with the basics. 141 00:09:06,849 --> 00:09:11,598 Tracks cover all relevant courses and workshops necessary to master a subject. 142 00:09:11,598 --> 00:09:15,776 So you'll be coding with Python in no time. 143 00:09:15,776 --> 00:09:19,026 To learn the math needed to work in data science, 144 00:09:19,026 --> 00:09:23,166 consider starting with basic statistics for data analysis. 145 00:09:23,166 --> 00:09:26,877 This course teaches students how to calculate basic statistics, 146 00:09:26,877 --> 00:09:29,470 solidify understanding of the terminology. 147 00:09:29,470 --> 00:09:34,890 And determine which graphs might be most useful in displaying data. 148 00:09:34,890 --> 00:09:39,421 In our machine learning basics course you can explore the fundamental concepts in 149 00:09:39,421 --> 00:09:40,543 machine learning. 150 00:09:40,543 --> 00:09:45,201 For instance supervised versus unsupervised learning as well as write 151 00:09:45,201 --> 00:09:46,761 a bit of code in Python. 152 00:09:46,761 --> 00:09:49,811 So that you can make intelligent predictions. 153 00:09:49,811 --> 00:09:55,866 To establish a solid foundation in all things data, check out our library for 154 00:09:55,866 --> 00:10:00,061 many courses and workshops covering data cleaning, 155 00:10:00,061 --> 00:10:03,619 visualization, analysis, and much more. 156 00:10:03,619 --> 00:10:08,521 Better yet, dive into the data analysis Techdegree where you'll learn how to 157 00:10:08,521 --> 00:10:12,607 use data to answer questions, gather actionable insights, and 158 00:10:12,607 --> 00:10:16,412 tell a story using spreadsheets, databases, and Python. 159 00:10:16,412 --> 00:10:20,580 You'll graduate with a data analysis certification through a credible and 160 00:10:20,580 --> 00:10:24,880 a well-rounded portfolio of projects that demonstrates your newly acquired 161 00:10:24,880 --> 00:10:25,610 expertise. 162 00:10:28,168 --> 00:10:33,064 The industries of the future will be built on the foundations of digital technology, 163 00:10:33,064 --> 00:10:35,787 artificial intelligence, and data science. 164 00:10:35,787 --> 00:10:39,121 These fields are rapidly expanding and the demand for 165 00:10:39,121 --> 00:10:42,018 skilled professionals is growing every day. 166 00:10:42,018 --> 00:10:46,935 Organizations everywhere are actively searching for data science experts to help 167 00:10:46,935 --> 00:10:50,879 them improve their products, tools, and business operations. 168 00:10:50,879 --> 00:10:52,587 By learning data science, 169 00:10:52,587 --> 00:10:57,129 you can increase your chances of being hired in a rapidly growing field. 170 00:10:57,129 --> 00:11:02,131 Learning data science can be a game changer for your career. 171 00:11:02,131 --> 00:11:05,863 We encourage you to take the next step. 172 00:11:05,863 --> 00:11:12,462 Whatever data science path you choose, remember, treehouse is here to help. 173 00:11:12,462 --> 00:11:13,270 Thanks for watching.