{"id":36621,"date":"2024-11-01T09:50:03","date_gmt":"2024-11-01T09:50:03","guid":{"rendered":"http:\/\/atmokpo.com\/w\/?p=36621"},"modified":"2024-11-01T11:52:28","modified_gmt":"2024-11-01T11:52:28","slug":"deep-learning-pytorch-course-preprocessing-checking-missing-values","status":"publish","type":"post","link":"https:\/\/atmokpo.com\/w\/36621\/","title":{"rendered":"Deep Learning PyTorch Course, Preprocessing, Checking Missing Values"},"content":{"rendered":"<p><body><\/p>\n<p>The performance of deep learning models heavily depends on the quality of the data. Therefore, data preprocessing is one of the most important processes in building deep learning models. In this course, we will explain how to perform data preprocessing using Pytorch and how to check for missing values in a dataset.<\/p>\n<h2>1. What is Data Preprocessing?<\/h2>\n<p>Data Preprocessing is the process of transforming raw data into a suitable format for analysis. This process can include several stages and typically involves the following tasks.<\/p>\n<ul>\n<li>Handling missing values<\/li>\n<li>Normalization and standardization<\/li>\n<li>Categorical data encoding<\/li>\n<li>Data splitting (train\/validation\/test)<\/li>\n<\/ul>\n<h2>2. Handling Missing Values<\/h2>\n<p>Missing Values refer to the state in which certain values in a dataset are empty. Missing values can negatively impact analysis results, so they need to be handled properly. There are various methods for handling missing values, and some of the representative methods are as follows.<\/p>\n<ul>\n<li>Row removal: A method that deletes rows with missing values<\/li>\n<li>Column removal: A method that deletes columns with missing values<\/li>\n<li>Imputation: A method that replaces missing values with the mean, median, mode, etc.<\/li>\n<\/ul>\n<h2>3. Preprocessing and Checking for Missing Values with Pytorch<\/h2>\n<p>Now, let&#8217;s perform actual data preprocessing and check for missing values using Pytorch. First, we import the necessary libraries.<\/p>\n<pre><code>import numpy as np\nimport pandas as pd\nimport torch\nfrom torch.utils.data import Dataset, DataLoader<\/code><\/pre>\n<h3>3.1 Creating a Dataset<\/h3>\n<p>Let\u2019s create a dataset to use as an example. This dataset contains some missing values.<\/p>\n<pre><code>data = {\n    'feature_1': [1.0, 2.5, np.nan, 4.5, 5.0],\n    'feature_2': [np.nan, 1.5, 2.0, 2.5, 3.0],\n    'label': [0, 1, 0, 1, 0]\n}\n\ndf = pd.DataFrame(data)\nprint(df)<\/code><\/pre>\n<h3>3.2 Checking for Missing Values<\/h3>\n<p>You can check for missing values using Pandas. The <code>isnull()<\/code> method can be used to identify missing values.<\/p>\n<pre><code># Checking for missing values\nmissing_values = df.isnull().sum()\nprint(\"Number of missing values in each column:\\n\", missing_values)<\/code><\/pre>\n<h3>3.3 Handling Missing Values<\/h3>\n<p>Let&#8217;s look at how to handle missing values. Here, we will use the method of replacing missing values with the mean.<\/p>\n<pre><code># Replacing missing values with mean\ndf['feature_1'].fillna(df['feature_1'].mean(), inplace=True)\ndf['feature_2'].fillna(df['feature_2'].mean(), inplace=True)\nprint(\"After replacing missing values:\\n\", df)<\/code><\/pre>\n<h3>4. Converting the Dataset to a Pytorch Dataset<\/h3>\n<p>Once the data preprocessing is complete, we convert the dataset by inheriting from Pytorch&#8217;s <code>Dataset<\/code> class.<\/p>\n<pre><code>class MyDataset(Dataset):\n    def __init__(self, dataframe):\n        self.dataframe = dataframe\n        \n    def __len__(self):\n        return len(self.dataframe)\n\n    def __getitem__(self, idx):\n        return torch.tensor(self.dataframe.iloc[idx, :-1].values, dtype=torch.float32), \\\n               torch.tensor(self.dataframe.iloc[idx, -1], dtype=torch.long)\n\ndataset = MyDataset(df)\ndataloader = DataLoader(dataset, batch_size=2, shuffle=True)<\/code><\/pre>\n<h3>5. Conclusion<\/h3>\n<p>In this course, we learned about the importance of data preprocessing in deep learning and methods to handle missing values. We practiced checking and handling missing values using Pytorch, which helped us learn effective ways to prepare a dataset. Since data preprocessing is a crucial step to enhance the performance of deep learning models, it must be well understood and utilized.<\/p>\n<h2>References<\/h2>\n<p>You can find more information through the following resources:<\/p>\n<ul>\n<li><a href=\"https:\/\/pytorch.org\/docs\/stable\/index.html\">Pytorch Documentation<\/a><\/li>\n<li><a href=\"https:\/\/pandas.pydata.org\/pandas-docs\/stable\/\">Pandas Documentation<\/a><\/li>\n<li><a href=\"https:\/\/scikit-learn.org\/stable\/\">Scikit-learn Documentation<\/a><\/li>\n<\/ul>\n<footer>\n<p>Author: [Your Name] | Date: [Date Written]<\/p>\n<\/footer>\n<p><\/body><\/p>\n","protected":false},"excerpt":{"rendered":"<p>The performance of deep learning models heavily depends on the quality of the data. Therefore, data preprocessing is one of the most important processes in building deep learning models. In this course, we will explain how to perform data preprocessing using Pytorch and how to check for missing values in a dataset. 1. What is &hellip; <a href=\"https:\/\/atmokpo.com\/w\/36621\/\" class=\"more-link\">\ub354 \ubcf4\uae30<span class=\"screen-reader-text\"> &#8220;Deep Learning PyTorch Course, Preprocessing, Checking Missing Values&#8221;<\/span><\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_jetpack_memberships_contains_paid_content":false,"footnotes":""},"categories":[149],"tags":[],"class_list":["post-36621","post","type-post","status-publish","format-standard","hentry","category-pytorch-study"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v26.2 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>Deep Learning PyTorch Course, Preprocessing, Checking Missing Values - \ub77c\uc774\ube0c\uc2a4\ub9c8\ud2b8<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/atmokpo.com\/w\/36621\/\" \/>\n<meta property=\"og:locale\" content=\"ko_KR\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Deep Learning PyTorch Course, Preprocessing, Checking Missing Values - \ub77c\uc774\ube0c\uc2a4\ub9c8\ud2b8\" \/>\n<meta property=\"og:description\" content=\"The performance of deep learning models heavily depends on the quality of the data. Therefore, data preprocessing is one of the most important processes in building deep learning models. In this course, we will explain how to perform data preprocessing using Pytorch and how to check for missing values in a dataset. 1. What is &hellip; \ub354 \ubcf4\uae30 &quot;Deep Learning PyTorch Course, Preprocessing, Checking Missing Values&quot;\" \/>\n<meta property=\"og:url\" content=\"https:\/\/atmokpo.com\/w\/36621\/\" \/>\n<meta property=\"og:site_name\" content=\"\ub77c\uc774\ube0c\uc2a4\ub9c8\ud2b8\" \/>\n<meta property=\"article:published_time\" content=\"2024-11-01T09:50:03+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2024-11-01T11:52:28+00:00\" \/>\n<meta name=\"author\" content=\"root\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:creator\" content=\"@bebubo4\" \/>\n<meta name=\"twitter:site\" content=\"@bebubo4\" \/>\n<meta name=\"twitter:label1\" content=\"\uae00\uc4f4\uc774\" \/>\n\t<meta name=\"twitter:data1\" content=\"root\" \/>\n\t<meta name=\"twitter:label2\" content=\"\uc608\uc0c1 \ub418\ub294 \ud310\ub3c5 \uc2dc\uac04\" \/>\n\t<meta name=\"twitter:data2\" content=\"3\ubd84\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\/\/atmokpo.com\/w\/36621\/#article\",\"isPartOf\":{\"@id\":\"https:\/\/atmokpo.com\/w\/36621\/\"},\"author\":{\"name\":\"root\",\"@id\":\"https:\/\/atmokpo.com\/w\/#\/schema\/person\/91b6b3b138fbba0efb4ae64b1abd81d7\"},\"headline\":\"Deep Learning PyTorch Course, Preprocessing, Checking Missing Values\",\"datePublished\":\"2024-11-01T09:50:03+00:00\",\"dateModified\":\"2024-11-01T11:52:28+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\/\/atmokpo.com\/w\/36621\/\"},\"wordCount\":387,\"publisher\":{\"@id\":\"https:\/\/atmokpo.com\/w\/#organization\"},\"articleSection\":[\"PyTorch Study\"],\"inLanguage\":\"ko-KR\"},{\"@type\":\"WebPage\",\"@id\":\"https:\/\/atmokpo.com\/w\/36621\/\",\"url\":\"https:\/\/atmokpo.com\/w\/36621\/\",\"name\":\"Deep Learning PyTorch Course, Preprocessing, Checking Missing Values - \ub77c\uc774\ube0c\uc2a4\ub9c8\ud2b8\",\"isPartOf\":{\"@id\":\"https:\/\/atmokpo.com\/w\/#website\"},\"datePublished\":\"2024-11-01T09:50:03+00:00\",\"dateModified\":\"2024-11-01T11:52:28+00:00\",\"breadcrumb\":{\"@id\":\"https:\/\/atmokpo.com\/w\/36621\/#breadcrumb\"},\"inLanguage\":\"ko-KR\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/atmokpo.com\/w\/36621\/\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/atmokpo.com\/w\/36621\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"\ud648\",\"item\":\"https:\/\/atmokpo.com\/w\/en\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Deep Learning PyTorch Course, Preprocessing, Checking Missing Values\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/atmokpo.com\/w\/#website\",\"url\":\"https:\/\/atmokpo.com\/w\/\",\"name\":\"\ub77c\uc774\ube0c\uc2a4\ub9c8\ud2b8\",\"description\":\"\",\"publisher\":{\"@id\":\"https:\/\/atmokpo.com\/w\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/atmokpo.com\/w\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"ko-KR\"},{\"@type\":\"Organization\",\"@id\":\"https:\/\/atmokpo.com\/w\/#organization\",\"name\":\"\ub77c\uc774\ube0c\uc2a4\ub9c8\ud2b8\",\"url\":\"https:\/\/atmokpo.com\/w\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"ko-KR\",\"@id\":\"https:\/\/atmokpo.com\/w\/#\/schema\/logo\/image\/\",\"url\":\"https:\/\/atmokpo.com\/w\/wp-content\/uploads\/2024\/11\/logo.png\",\"contentUrl\":\"https:\/\/atmokpo.com\/w\/wp-content\/uploads\/2024\/11\/logo.png\",\"width\":400,\"height\":400,\"caption\":\"\ub77c\uc774\ube0c\uc2a4\ub9c8\ud2b8\"},\"image\":{\"@id\":\"https:\/\/atmokpo.com\/w\/#\/schema\/logo\/image\/\"},\"sameAs\":[\"https:\/\/x.com\/bebubo4\"]},{\"@type\":\"Person\",\"@id\":\"https:\/\/atmokpo.com\/w\/#\/schema\/person\/91b6b3b138fbba0efb4ae64b1abd81d7\",\"name\":\"root\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"ko-KR\",\"@id\":\"https:\/\/atmokpo.com\/w\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/708197b41fc6435a7ce22d951b25d4a47e9e904270cb1f04682d4f025066f80c?s=96&d=mm&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/708197b41fc6435a7ce22d951b25d4a47e9e904270cb1f04682d4f025066f80c?s=96&d=mm&r=g\",\"caption\":\"root\"},\"sameAs\":[\"http:\/\/atmokpo.com\/w\"],\"url\":\"https:\/\/atmokpo.com\/w\/author\/root\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Deep Learning PyTorch Course, Preprocessing, Checking Missing Values - \ub77c\uc774\ube0c\uc2a4\ub9c8\ud2b8","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/atmokpo.com\/w\/36621\/","og_locale":"ko_KR","og_type":"article","og_title":"Deep Learning PyTorch Course, Preprocessing, Checking Missing Values - \ub77c\uc774\ube0c\uc2a4\ub9c8\ud2b8","og_description":"The performance of deep learning models heavily depends on the quality of the data. Therefore, data preprocessing is one of the most important processes in building deep learning models. In this course, we will explain how to perform data preprocessing using Pytorch and how to check for missing values in a dataset. 1. What is &hellip; \ub354 \ubcf4\uae30 \"Deep Learning PyTorch Course, Preprocessing, Checking Missing Values\"","og_url":"https:\/\/atmokpo.com\/w\/36621\/","og_site_name":"\ub77c\uc774\ube0c\uc2a4\ub9c8\ud2b8","article_published_time":"2024-11-01T09:50:03+00:00","article_modified_time":"2024-11-01T11:52:28+00:00","author":"root","twitter_card":"summary_large_image","twitter_creator":"@bebubo4","twitter_site":"@bebubo4","twitter_misc":{"\uae00\uc4f4\uc774":"root","\uc608\uc0c1 \ub418\ub294 \ud310\ub3c5 \uc2dc\uac04":"3\ubd84"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/atmokpo.com\/w\/36621\/#article","isPartOf":{"@id":"https:\/\/atmokpo.com\/w\/36621\/"},"author":{"name":"root","@id":"https:\/\/atmokpo.com\/w\/#\/schema\/person\/91b6b3b138fbba0efb4ae64b1abd81d7"},"headline":"Deep Learning PyTorch Course, Preprocessing, Checking Missing Values","datePublished":"2024-11-01T09:50:03+00:00","dateModified":"2024-11-01T11:52:28+00:00","mainEntityOfPage":{"@id":"https:\/\/atmokpo.com\/w\/36621\/"},"wordCount":387,"publisher":{"@id":"https:\/\/atmokpo.com\/w\/#organization"},"articleSection":["PyTorch Study"],"inLanguage":"ko-KR"},{"@type":"WebPage","@id":"https:\/\/atmokpo.com\/w\/36621\/","url":"https:\/\/atmokpo.com\/w\/36621\/","name":"Deep Learning PyTorch Course, Preprocessing, Checking Missing Values - \ub77c\uc774\ube0c\uc2a4\ub9c8\ud2b8","isPartOf":{"@id":"https:\/\/atmokpo.com\/w\/#website"},"datePublished":"2024-11-01T09:50:03+00:00","dateModified":"2024-11-01T11:52:28+00:00","breadcrumb":{"@id":"https:\/\/atmokpo.com\/w\/36621\/#breadcrumb"},"inLanguage":"ko-KR","potentialAction":[{"@type":"ReadAction","target":["https:\/\/atmokpo.com\/w\/36621\/"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/atmokpo.com\/w\/36621\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"\ud648","item":"https:\/\/atmokpo.com\/w\/en\/"},{"@type":"ListItem","position":2,"name":"Deep Learning PyTorch Course, Preprocessing, Checking Missing Values"}]},{"@type":"WebSite","@id":"https:\/\/atmokpo.com\/w\/#website","url":"https:\/\/atmokpo.com\/w\/","name":"\ub77c\uc774\ube0c\uc2a4\ub9c8\ud2b8","description":"","publisher":{"@id":"https:\/\/atmokpo.com\/w\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/atmokpo.com\/w\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"ko-KR"},{"@type":"Organization","@id":"https:\/\/atmokpo.com\/w\/#organization","name":"\ub77c\uc774\ube0c\uc2a4\ub9c8\ud2b8","url":"https:\/\/atmokpo.com\/w\/","logo":{"@type":"ImageObject","inLanguage":"ko-KR","@id":"https:\/\/atmokpo.com\/w\/#\/schema\/logo\/image\/","url":"https:\/\/atmokpo.com\/w\/wp-content\/uploads\/2024\/11\/logo.png","contentUrl":"https:\/\/atmokpo.com\/w\/wp-content\/uploads\/2024\/11\/logo.png","width":400,"height":400,"caption":"\ub77c\uc774\ube0c\uc2a4\ub9c8\ud2b8"},"image":{"@id":"https:\/\/atmokpo.com\/w\/#\/schema\/logo\/image\/"},"sameAs":["https:\/\/x.com\/bebubo4"]},{"@type":"Person","@id":"https:\/\/atmokpo.com\/w\/#\/schema\/person\/91b6b3b138fbba0efb4ae64b1abd81d7","name":"root","image":{"@type":"ImageObject","inLanguage":"ko-KR","@id":"https:\/\/atmokpo.com\/w\/#\/schema\/person\/image\/","url":"https:\/\/secure.gravatar.com\/avatar\/708197b41fc6435a7ce22d951b25d4a47e9e904270cb1f04682d4f025066f80c?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/708197b41fc6435a7ce22d951b25d4a47e9e904270cb1f04682d4f025066f80c?s=96&d=mm&r=g","caption":"root"},"sameAs":["http:\/\/atmokpo.com\/w"],"url":"https:\/\/atmokpo.com\/w\/author\/root\/"}]}},"jetpack_featured_media_url":"","jetpack_sharing_enabled":true,"jetpack-related-posts":[],"_links":{"self":[{"href":"https:\/\/atmokpo.com\/w\/wp-json\/wp\/v2\/posts\/36621","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/atmokpo.com\/w\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/atmokpo.com\/w\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/atmokpo.com\/w\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/atmokpo.com\/w\/wp-json\/wp\/v2\/comments?post=36621"}],"version-history":[{"count":1,"href":"https:\/\/atmokpo.com\/w\/wp-json\/wp\/v2\/posts\/36621\/revisions"}],"predecessor-version":[{"id":36622,"href":"https:\/\/atmokpo.com\/w\/wp-json\/wp\/v2\/posts\/36621\/revisions\/36622"}],"wp:attachment":[{"href":"https:\/\/atmokpo.com\/w\/wp-json\/wp\/v2\/media?parent=36621"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/atmokpo.com\/w\/wp-json\/wp\/v2\/categories?post=36621"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/atmokpo.com\/w\/wp-json\/wp\/v2\/tags?post=36621"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}