<?xml version="1.0" encoding="UTF-8"?><rss xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:atom="http://www.w3.org/2005/Atom" version="2.0"><channel><title><![CDATA[“Getting Started with Data Analysis"]]></title><description><![CDATA[“Getting Started with Data Analysis"]]></description><link>https://blogs.sdipin.com.np</link><generator>RSS for Node</generator><lastBuildDate>Sat, 18 Apr 2026 01:31:00 GMT</lastBuildDate><atom:link href="https://blogs.sdipin.com.np/rss.xml" rel="self" type="application/rss+xml"/><language><![CDATA[en]]></language><ttl>60</ttl><item><title><![CDATA[Data Analysis using python Pandas , Numpy, and Matplotlib]]></title><description><![CDATA[🔧 Tools Covered:

Python

Pandas

NumPy

Matplotlib


1.Introduction
Data is everywhere—and making sense of it is the real challenge. In this post, I’ll walk you through how I analyzed real-world datasets using Python libraries like Pandas, NumPy, a...]]></description><link>https://blogs.sdipin.com.np/data-analysis-using-python-pandas-numpy-and-matplotlib</link><guid isPermaLink="true">https://blogs.sdipin.com.np/data-analysis-using-python-pandas-numpy-and-matplotlib</guid><dc:creator><![CDATA[Dipin Silwal]]></dc:creator><pubDate>Wed, 02 Jul 2025 05:54:28 GMT</pubDate><enclosure url="https://cdn.hashnode.com/res/hashnode/image/stock/unsplash/8mikJ83LmSQ/upload/f30f947b8de490cf0be6a53126443db3.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<h3 id="heading-tools-covered">🔧 Tools Covered:</h3>
<ul>
<li><p>Python</p>
</li>
<li><p>Pandas</p>
</li>
<li><p>NumPy</p>
</li>
<li><p>Matplotlib</p>
</li>
</ul>
<h3 id="heading-1introduction">1.<strong>Introduction</strong></h3>
<p>Data is everywhere—and making sense of it is the real challenge. In this post, I’ll walk you through how I analyzed real-world datasets using Python libraries like Pandas, NumPy, and Matplotlib. Whether you're a beginner or brushing up your skills, this guide will help you get started.</p>
<h3 id="heading-2-loading-and-exploring-the-dataset"><strong>2. Loading and Exploring the Dataset</strong></h3>
<p>Let’s begin by importing the necessary libraries and loading the dataset.</p>
<pre><code class="lang-python"><span class="hljs-keyword">import</span> pandas <span class="hljs-keyword">as</span> pd
<span class="hljs-keyword">import</span> numpy <span class="hljs-keyword">as</span> np
<span class="hljs-keyword">import</span> matplotlib.pyplot <span class="hljs-keyword">as</span> plt

df = pd.read_csv(<span class="hljs-string">'netflix_titles.csv'</span>)
df.head()
</code></pre>
<p>This gives us a first look at the structure of our data.</p>
<hr />
<h3 id="heading-3-data-cleaning">3. <strong>Data Cleaning</strong></h3>
<pre><code class="lang-markdown">Now, let’s clean the data by handling missing values and checking for duplicates.

<span class="hljs-code">```python
df.isnull().sum()
df.drop_duplicates(inplace=True)</span>
</code></pre>
<p>We removed duplicates and can now deal with missing values accordingly.</p>
<hr />
<h3 id="heading-4-data-analysis">4. <strong>Data Analysis</strong></h3>
<pre><code class="lang-markdown">Let’s answer some interesting questions like:

<span class="hljs-bullet">-</span> What’s the most popular genre?
<span class="hljs-bullet">-</span> How many shows were released each year?

<span class="hljs-code">```python
df['release_year'] = pd.DatetimeIndex(df['date_added']).year
df['release_year'].value_counts().sort_index().plot(kind='bar', figsize=(10,4))
plt.title("Content Released Per Year")
plt.xlabel("Year")
plt.ylabel("Number of Titles")
plt.grid(True)
plt.tight_layout()
plt.show()</span>
</code></pre>
<hr />
<h3 id="heading-5-visualizations">5. <strong>Visualizations</strong></h3>
<pre><code class="lang-markdown">Visualization makes data stories clearer. Here’s a pie chart showing the distribution of content types:

<span class="hljs-code">```python
df['type'].value_counts().plot(kind='pie', autopct='%1.1f%%', colors=['#66b3ff','#ff9999'])
plt.title("Distribution of Content Types")
plt.ylabel("")
plt.show()</span>
</code></pre>
<hr />
<h3 id="heading-6-conclusion">6. <strong>Conclusion</strong></h3>
<pre><code class="lang-markdown">This project helped me understand the power of Python for data analysis. Libraries like Pandas, NumPy, and Matplotlib simplify the process from cleaning to visualizing. I’ll continue sharing more deep-dives into real-world datasets, so stay tuned!
</code></pre>
]]></content:encoded></item></channel></rss>