Python Archives - Thep Excel

pandas drop_duplicates — ลบแถวซ้ำออกจาก DataFrame

Cleaning

5 3 5

drop_duplicates ใน pandas ผมใช้สำหรับลบแถวข้อมูลที่ซ้ำกันออกจาก DataFrame เหมือนกับปุ่ม Remove Duplicates ใน Excel เลยครับ แต่ยืดหยุ่นกว่าตรงที่เราเลือกได้ว่าจะดูซ้ำจากคอลัมน์ไหน และจะเก็บแถวแรกหรือแถวสุดท้ายไว้

Syntax

df.drop_duplicates(subset, keep)

pandas dropna — ลบแถว (หรือคอลัมน์) ที่มีค่าว่างออกจาก DataFrame

Cleaning

5 3 5

dropna ใน pandas ผมใช้สำหรับกำจัดแถวหรือคอลัมน์ที่มีค่า NaN ออกจาก DataFrame ครับ เหมาะมากสำหรับขั้นตอนทำความสะอาดข้อมูลก่อนวิเคราะห์ เพราะค่า NaN แฝงอยู่ในข้อมูลจริงแทบทุกชุด

Syntax

df.dropna(axis, how, subset, inplace)

pandas fillna — เติมค่าที่หายไป (NaN) ใน DataFrame

Cleaning

5 3 5

df.fillna() ใน pandas ผมใช้สำหรับเติมค่า NaN ที่หายไปใน DataFrame ด้วยค่าที่กำหนด เช่น 0, ค่าเฉลี่ย หรือค่าจากแถวก่อนหน้า เป็นขั้นตอนสำคัญในการทำความสะอาดข้อมูลก่อนวิเคราะห์หรือส่งเข้า model ครับ

Syntax

df.fillna(value)

pandas isna — เช็คว่าช่องไหนเป็นค่าว่าง (NaN)

Cleaning

5 3 5

isna ใน pandas ผมใช้สำหรับเช็คว่าแต่ละช่องใน DataFrame มีค่าว่าง (NaN) อยู่หรือเปล่า คืนผลเป็น True/False ทุกช่อง เหมือน ISBLANK ใน Excel เลยครับ แต่ทำได้กับทั้งตารางพร้อมกันในคำสั่งเดียว

Syntax

df.isna()

pandas replace — แทนค่าใน DataFrame/Series งานทำความสะอาดข้อมูล

Cleaning

5 3 5

replace ใน pandas ผมใช้สำหรับแทนที่ค่าหนึ่งด้วยอีกค่าหนึ่งทั้ง DataFrame หรือ Series เหมาะกับงาน cleaning ข้อมูล ใครเคยใช้ Find & Replace (Ctrl+H) ใน Excel มาก่อน บอกเลยว่าตัวนี้คือพี่น้องกันเลยครับ

Syntax

df.replace(to_replace, value)

Cookie	Duration	Description
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

pandas drop_duplicates — ลบแถวซ้ำออกจาก DataFrame

pandas dropna — ลบแถว (หรือคอลัมน์) ที่มีค่าว่างออกจาก DataFrame

pandas fillna — เติมค่าที่หายไป (NaN) ใน DataFrame

pandas isna — เช็คว่าช่องไหนเป็นค่าว่าง (NaN)

pandas replace — แทนค่าใน DataFrame/Series งานทำความสะอาดข้อมูล

เว็บไซต์นี้ใช้คุกกี้ (Cookies)