{"id":36627,"date":"2024-11-01T09:50:06","date_gmt":"2024-11-01T09:50:06","guid":{"rendered":"http:\/\/atmokpo.com\/w\/?p=36627"},"modified":"2024-11-01T11:52:27","modified_gmt":"2024-11-01T11:52:27","slug":"deep-learning-pytorch-course-preprocessing-normalization","status":"publish","type":"post","link":"https:\/\/atmokpo.com\/w\/36627\/","title":{"rendered":"Deep Learning PyTorch Course, Preprocessing, Normalization"},"content":{"rendered":"<p><body><\/p>\n<p>To effectively train deep learning models, the quality of data is extremely important. Therefore, data preprocessing and normalization are essential processes in deep learning tasks. In this article, we will introduce the importance of data preprocessing and normalization techniques, and explain how to prepare and process exposed data using PyTorch with practical examples.<\/p>\n<h2>Table of Contents<\/h2>\n<ul>\n<li><a href=\"#preprocessing\">1. What is Data Preprocessing?<\/a><\/li>\n<li><a href=\"#normalization\">2. What is Normalization?<\/a><\/li>\n<li><a href=\"#why\">3. Why are Preprocessing and Normalization Necessary?<\/a><\/li>\n<li><a href=\"#pytorch_preprocessing\">4. Data Preprocessing in PyTorch<\/a><\/li>\n<li><a href=\"#pytorch_normalization\">5. Normalization in PyTorch<\/a><\/li>\n<li><a href=\"#conclusion\">6. Conclusion<\/a><\/li>\n<\/ul>\n<h2 id=\"preprocessing\">1. What is Data Preprocessing?<\/h2>\n<p>Data preprocessing refers to the process of transforming and cleaning data before inputting it into machine learning or deep learning models. This process ensures the consistency, integrity, and quality of the data. The preprocessing phase includes tasks such as handling missing values, removing outliers, encoding categorical variables, normalizing data, and feature selection.<\/p>\n<h3>1.1 Handling Missing Values<\/h3>\n<p>Missing values can especially cause issues in data analysis. Let\u2019s explore how to detect and handle missing values using the <code>pandas<\/code> library in Python.<\/p>\n<pre><code>import pandas as pd\n\n# Load data\ndata = pd.read_csv('data.csv')\n\n# Check for missing values\nprint(data.isnull().sum())\n\n# Remove missing values\ndata_cleaned = data.dropna()\n# Or replace missing values with the mean\ndata.fillna(data.mean(), inplace=True)<\/code><\/pre>\n<h3>1.2 Detecting and Removing Outliers<\/h3>\n<p>Outliers can negatively impact the training of a model. There are several methods to detect and remove outliers; here, we will show an example using the IQR method.<\/p>\n<pre><code>Q1 = data['column_name'].quantile(0.25)\nQ3 = data['column_name'].quantile(0.75)\nIQR = Q3 - Q1\n\n# Detect outliers using IQR\noutliers = data[(data['column_name'] &lt; (Q1 - 1.5 * IQR)) | (data['column_name'] &gt; (Q3 + 1.5 * IQR))]\ndata_no_outliers = data[~data.index.isin(outliers.index)]<\/code><\/pre>\n<h2 id=\"normalization\">2. What is Normalization?<\/h2>\n<p>Normalization is the process of transforming values of data with different ranges into a consistent range. This can improve the convergence speed of the model and reduce the impact of specific features on the model. Min-Max normalization and Z-score normalization are commonly used methods.<\/p>\n<h3>2.1 Min-Max Normalization<\/h3>\n<p>Min-Max normalization transforms the values of each feature to a scale between 0 and 1. This method follows the formula:<\/p>\n<pre><code>X' = (X - X_min) \/ (X_max - X_min)<\/code><\/pre>\n<h3>2.2 Z-score Normalization<\/h3>\n<p>Z-score normalization transforms the values of each feature so that they have a mean of 0 and a standard deviation of 1. This method follows the formula:<\/p>\n<pre><code>X' = (X - \u03bc) \/ \u03c3<\/code><\/pre>\n<p>Here, <code>\u03bc<\/code> is the mean and <code>\u03c3<\/code> is the standard deviation.<\/p>\n<h2 id=\"why\">3. Why are Preprocessing and Normalization Necessary?<\/h2>\n<p>The processes of data preprocessing and normalization are essential for maximizing the performance of models. This is because:<\/p>\n<ul>\n<li>If there are missing values or outliers, the generalization performance of the model may decrease.<\/li>\n<li>Data that is not normalized can slow down the training speed and cause convergence issues in optimization algorithms.<\/li>\n<li>Features with different ranges can lead the model to overestimate or underestimate specific features.<\/li>\n<\/ul>\n<h2 id=\"pytorch_preprocessing\">4. Data Preprocessing in PyTorch<\/h2>\n<p>In PyTorch, images can be preprocessed using <code>torchvision.transforms<\/code>. Generally, the following transformations are applied when loading a dataset.<\/p>\n<pre><code>import torchvision.transforms as transforms\nfrom torchvision import datasets\n\ntransform = transforms.Compose([\n    transforms.Resize((256, 256)),\n    transforms.ToTensor(),\n    transforms.Normalize(mean=[0.5, 0.5, 0.5], std=[0.5, 0.5, 0.5]),\n])\n\n# Load dataset\ntrain_dataset = datasets.CIFAR10(root='.\/data', train=True, download=True, transform=transform)\ntrain_loader = torch.utils.data.DataLoader(dataset=train_dataset, batch_size=64, shuffle=True)<\/code><\/pre>\n<h2 id=\"pytorch_normalization\">5. Normalization in PyTorch<\/h2>\n<p>PyTorch provides predefined normalization layers to easily perform image normalization. Here\u2019s how to normalize image data.<\/p>\n<pre><code>import torch\nimport torchvision.transforms as transforms\n\n# Define normalization transformation\nnormalize = transforms.Normalize(mean=[0.5, 0.5, 0.5], std=[0.5, 0.5, 0.5])\n\n# Sample image tensor\nimage = torch.randn(3, 256, 256)  # (number of channels, height, width)\n\n# Apply normalization\nnormalized_image = normalize(image)<\/code><\/pre>\n<h2 id=\"conclusion\">6. Conclusion<\/h2>\n<p>The performance of deep learning models heavily depends on the quality of data. Preprocessing and normalization are essential steps in preparing data for effective learning by the model. By utilizing PyTorch, we can effectively carry out these preprocessing and normalization tasks. Through this tutorial, we have understood the necessity of data preprocessing and normalization, and learned how to implement them in PyTorch with actual code examples. In future deep learning projects, we should always pay attention to the data preprocessing and normalization processes.<\/p>\n<footer>\n<p>I hope this article helps you in your deep learning studies.<\/p>\n<\/footer>\n<p><\/body><\/p>\n","protected":false},"excerpt":{"rendered":"<p>To effectively train deep learning models, the quality of data is extremely important. Therefore, data preprocessing and normalization are essential processes in deep learning tasks. In this article, we will introduce the importance of data preprocessing and normalization techniques, and explain how to prepare and process exposed data using PyTorch with practical examples. Table of &hellip; <a href=\"https:\/\/atmokpo.com\/w\/36627\/\" class=\"more-link\">\ub354 \ubcf4\uae30<span class=\"screen-reader-text\"> &#8220;Deep Learning PyTorch Course, Preprocessing, Normalization&#8221;<\/span><\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_jetpack_memberships_contains_paid_content":false,"footnotes":""},"categories":[149],"tags":[],"class_list":["post-36627","post","type-post","status-publish","format-standard","hentry","category-pytorch-study"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v26.2 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>Deep Learning PyTorch Course, Preprocessing, Normalization - \ub77c\uc774\ube0c\uc2a4\ub9c8\ud2b8<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/atmokpo.com\/w\/36627\/\" \/>\n<meta property=\"og:locale\" content=\"ko_KR\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Deep Learning PyTorch Course, Preprocessing, Normalization - \ub77c\uc774\ube0c\uc2a4\ub9c8\ud2b8\" \/>\n<meta property=\"og:description\" content=\"To effectively train deep learning models, the quality of data is extremely important. Therefore, data preprocessing and normalization are essential processes in deep learning tasks. In this article, we will introduce the importance of data preprocessing and normalization techniques, and explain how to prepare and process exposed data using PyTorch with practical examples. Table of &hellip; \ub354 \ubcf4\uae30 &quot;Deep Learning PyTorch Course, Preprocessing, Normalization&quot;\" \/>\n<meta property=\"og:url\" content=\"https:\/\/atmokpo.com\/w\/36627\/\" \/>\n<meta property=\"og:site_name\" content=\"\ub77c\uc774\ube0c\uc2a4\ub9c8\ud2b8\" \/>\n<meta property=\"article:published_time\" content=\"2024-11-01T09:50:06+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2024-11-01T11:52:27+00:00\" \/>\n<meta name=\"author\" content=\"root\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:creator\" content=\"@bebubo4\" \/>\n<meta name=\"twitter:site\" content=\"@bebubo4\" \/>\n<meta name=\"twitter:label1\" content=\"\uae00\uc4f4\uc774\" \/>\n\t<meta name=\"twitter:data1\" content=\"root\" \/>\n\t<meta name=\"twitter:label2\" content=\"\uc608\uc0c1 \ub418\ub294 \ud310\ub3c5 \uc2dc\uac04\" \/>\n\t<meta name=\"twitter:data2\" content=\"3\ubd84\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\/\/atmokpo.com\/w\/36627\/#article\",\"isPartOf\":{\"@id\":\"https:\/\/atmokpo.com\/w\/36627\/\"},\"author\":{\"name\":\"root\",\"@id\":\"https:\/\/atmokpo.com\/w\/#\/schema\/person\/91b6b3b138fbba0efb4ae64b1abd81d7\"},\"headline\":\"Deep Learning PyTorch Course, Preprocessing, Normalization\",\"datePublished\":\"2024-11-01T09:50:06+00:00\",\"dateModified\":\"2024-11-01T11:52:27+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\/\/atmokpo.com\/w\/36627\/\"},\"wordCount\":508,\"publisher\":{\"@id\":\"https:\/\/atmokpo.com\/w\/#organization\"},\"articleSection\":[\"PyTorch Study\"],\"inLanguage\":\"ko-KR\"},{\"@type\":\"WebPage\",\"@id\":\"https:\/\/atmokpo.com\/w\/36627\/\",\"url\":\"https:\/\/atmokpo.com\/w\/36627\/\",\"name\":\"Deep Learning PyTorch Course, Preprocessing, Normalization - \ub77c\uc774\ube0c\uc2a4\ub9c8\ud2b8\",\"isPartOf\":{\"@id\":\"https:\/\/atmokpo.com\/w\/#website\"},\"datePublished\":\"2024-11-01T09:50:06+00:00\",\"dateModified\":\"2024-11-01T11:52:27+00:00\",\"breadcrumb\":{\"@id\":\"https:\/\/atmokpo.com\/w\/36627\/#breadcrumb\"},\"inLanguage\":\"ko-KR\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/atmokpo.com\/w\/36627\/\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/atmokpo.com\/w\/36627\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"\ud648\",\"item\":\"https:\/\/atmokpo.com\/w\/en\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Deep Learning PyTorch Course, Preprocessing, Normalization\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/atmokpo.com\/w\/#website\",\"url\":\"https:\/\/atmokpo.com\/w\/\",\"name\":\"\ub77c\uc774\ube0c\uc2a4\ub9c8\ud2b8\",\"description\":\"\",\"publisher\":{\"@id\":\"https:\/\/atmokpo.com\/w\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/atmokpo.com\/w\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"ko-KR\"},{\"@type\":\"Organization\",\"@id\":\"https:\/\/atmokpo.com\/w\/#organization\",\"name\":\"\ub77c\uc774\ube0c\uc2a4\ub9c8\ud2b8\",\"url\":\"https:\/\/atmokpo.com\/w\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"ko-KR\",\"@id\":\"https:\/\/atmokpo.com\/w\/#\/schema\/logo\/image\/\",\"url\":\"https:\/\/atmokpo.com\/w\/wp-content\/uploads\/2024\/11\/logo.png\",\"contentUrl\":\"https:\/\/atmokpo.com\/w\/wp-content\/uploads\/2024\/11\/logo.png\",\"width\":400,\"height\":400,\"caption\":\"\ub77c\uc774\ube0c\uc2a4\ub9c8\ud2b8\"},\"image\":{\"@id\":\"https:\/\/atmokpo.com\/w\/#\/schema\/logo\/image\/\"},\"sameAs\":[\"https:\/\/x.com\/bebubo4\"]},{\"@type\":\"Person\",\"@id\":\"https:\/\/atmokpo.com\/w\/#\/schema\/person\/91b6b3b138fbba0efb4ae64b1abd81d7\",\"name\":\"root\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"ko-KR\",\"@id\":\"https:\/\/atmokpo.com\/w\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/708197b41fc6435a7ce22d951b25d4a47e9e904270cb1f04682d4f025066f80c?s=96&d=mm&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/708197b41fc6435a7ce22d951b25d4a47e9e904270cb1f04682d4f025066f80c?s=96&d=mm&r=g\",\"caption\":\"root\"},\"sameAs\":[\"http:\/\/atmokpo.com\/w\"],\"url\":\"https:\/\/atmokpo.com\/w\/author\/root\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Deep Learning PyTorch Course, Preprocessing, Normalization - \ub77c\uc774\ube0c\uc2a4\ub9c8\ud2b8","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/atmokpo.com\/w\/36627\/","og_locale":"ko_KR","og_type":"article","og_title":"Deep Learning PyTorch Course, Preprocessing, Normalization - \ub77c\uc774\ube0c\uc2a4\ub9c8\ud2b8","og_description":"To effectively train deep learning models, the quality of data is extremely important. Therefore, data preprocessing and normalization are essential processes in deep learning tasks. In this article, we will introduce the importance of data preprocessing and normalization techniques, and explain how to prepare and process exposed data using PyTorch with practical examples. Table of &hellip; \ub354 \ubcf4\uae30 \"Deep Learning PyTorch Course, Preprocessing, Normalization\"","og_url":"https:\/\/atmokpo.com\/w\/36627\/","og_site_name":"\ub77c\uc774\ube0c\uc2a4\ub9c8\ud2b8","article_published_time":"2024-11-01T09:50:06+00:00","article_modified_time":"2024-11-01T11:52:27+00:00","author":"root","twitter_card":"summary_large_image","twitter_creator":"@bebubo4","twitter_site":"@bebubo4","twitter_misc":{"\uae00\uc4f4\uc774":"root","\uc608\uc0c1 \ub418\ub294 \ud310\ub3c5 \uc2dc\uac04":"3\ubd84"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/atmokpo.com\/w\/36627\/#article","isPartOf":{"@id":"https:\/\/atmokpo.com\/w\/36627\/"},"author":{"name":"root","@id":"https:\/\/atmokpo.com\/w\/#\/schema\/person\/91b6b3b138fbba0efb4ae64b1abd81d7"},"headline":"Deep Learning PyTorch Course, Preprocessing, Normalization","datePublished":"2024-11-01T09:50:06+00:00","dateModified":"2024-11-01T11:52:27+00:00","mainEntityOfPage":{"@id":"https:\/\/atmokpo.com\/w\/36627\/"},"wordCount":508,"publisher":{"@id":"https:\/\/atmokpo.com\/w\/#organization"},"articleSection":["PyTorch Study"],"inLanguage":"ko-KR"},{"@type":"WebPage","@id":"https:\/\/atmokpo.com\/w\/36627\/","url":"https:\/\/atmokpo.com\/w\/36627\/","name":"Deep Learning PyTorch Course, Preprocessing, Normalization - \ub77c\uc774\ube0c\uc2a4\ub9c8\ud2b8","isPartOf":{"@id":"https:\/\/atmokpo.com\/w\/#website"},"datePublished":"2024-11-01T09:50:06+00:00","dateModified":"2024-11-01T11:52:27+00:00","breadcrumb":{"@id":"https:\/\/atmokpo.com\/w\/36627\/#breadcrumb"},"inLanguage":"ko-KR","potentialAction":[{"@type":"ReadAction","target":["https:\/\/atmokpo.com\/w\/36627\/"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/atmokpo.com\/w\/36627\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"\ud648","item":"https:\/\/atmokpo.com\/w\/en\/"},{"@type":"ListItem","position":2,"name":"Deep Learning PyTorch Course, Preprocessing, Normalization"}]},{"@type":"WebSite","@id":"https:\/\/atmokpo.com\/w\/#website","url":"https:\/\/atmokpo.com\/w\/","name":"\ub77c\uc774\ube0c\uc2a4\ub9c8\ud2b8","description":"","publisher":{"@id":"https:\/\/atmokpo.com\/w\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/atmokpo.com\/w\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"ko-KR"},{"@type":"Organization","@id":"https:\/\/atmokpo.com\/w\/#organization","name":"\ub77c\uc774\ube0c\uc2a4\ub9c8\ud2b8","url":"https:\/\/atmokpo.com\/w\/","logo":{"@type":"ImageObject","inLanguage":"ko-KR","@id":"https:\/\/atmokpo.com\/w\/#\/schema\/logo\/image\/","url":"https:\/\/atmokpo.com\/w\/wp-content\/uploads\/2024\/11\/logo.png","contentUrl":"https:\/\/atmokpo.com\/w\/wp-content\/uploads\/2024\/11\/logo.png","width":400,"height":400,"caption":"\ub77c\uc774\ube0c\uc2a4\ub9c8\ud2b8"},"image":{"@id":"https:\/\/atmokpo.com\/w\/#\/schema\/logo\/image\/"},"sameAs":["https:\/\/x.com\/bebubo4"]},{"@type":"Person","@id":"https:\/\/atmokpo.com\/w\/#\/schema\/person\/91b6b3b138fbba0efb4ae64b1abd81d7","name":"root","image":{"@type":"ImageObject","inLanguage":"ko-KR","@id":"https:\/\/atmokpo.com\/w\/#\/schema\/person\/image\/","url":"https:\/\/secure.gravatar.com\/avatar\/708197b41fc6435a7ce22d951b25d4a47e9e904270cb1f04682d4f025066f80c?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/708197b41fc6435a7ce22d951b25d4a47e9e904270cb1f04682d4f025066f80c?s=96&d=mm&r=g","caption":"root"},"sameAs":["http:\/\/atmokpo.com\/w"],"url":"https:\/\/atmokpo.com\/w\/author\/root\/"}]}},"jetpack_featured_media_url":"","jetpack_sharing_enabled":true,"jetpack-related-posts":[],"_links":{"self":[{"href":"https:\/\/atmokpo.com\/w\/wp-json\/wp\/v2\/posts\/36627","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/atmokpo.com\/w\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/atmokpo.com\/w\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/atmokpo.com\/w\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/atmokpo.com\/w\/wp-json\/wp\/v2\/comments?post=36627"}],"version-history":[{"count":1,"href":"https:\/\/atmokpo.com\/w\/wp-json\/wp\/v2\/posts\/36627\/revisions"}],"predecessor-version":[{"id":36628,"href":"https:\/\/atmokpo.com\/w\/wp-json\/wp\/v2\/posts\/36627\/revisions\/36628"}],"wp:attachment":[{"href":"https:\/\/atmokpo.com\/w\/wp-json\/wp\/v2\/media?parent=36627"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/atmokpo.com\/w\/wp-json\/wp\/v2\/categories?post=36627"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/atmokpo.com\/w\/wp-json\/wp\/v2\/tags?post=36627"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}