728x90
문제
Table: Activity
+--------------+---------+
| Column Name | Type |
+--------------+---------+
| player_id | int |
| device_id | int |
| event_date | date |
| games_played | int |
+--------------+---------+
(player_id, event_date) is the primary key (combination of columns with unique values) of this table.
This table shows the activity of players of some games.
Each row is a record of a player who logged in and played a number of games (possibly 0) before logging out on someday using some device.
Write a solution to find the first login date for each player.
https://leetcode.com/problems/game-play-analysis-i/description/?lang=pythondata
예시
Input:
Activity table:
+-----------+-----------+------------+--------------+
| player_id | device_id | event_date | games_played |
+-----------+-----------+------------+--------------+
| 1 | 2 | 2016-03-01 | 5 |
| 1 | 2 | 2016-05-02 | 6 |
| 2 | 3 | 2017-06-25 | 1 |
| 3 | 1 | 2016-03-02 | 0 |
| 3 | 4 | 2018-07-03 | 5 |
+-----------+-----------+------------+--------------+
Output:
+-----------+-------------+
| player_id | first_login |
+-----------+-------------+
| 1 | 2016-03-01 |
| 2 | 2017-06-25 |
| 3 | 2016-03-02 |
+-----------+-------------+
문제 풀이
#1
import pandas as pd
def game_analysis(activity: pd.DataFrame) -> pd.DataFrame:
# event_date 기준 정렬
activity.sort_values(by='event_date', inplace=True)
# player_id 중복값 제거 keep 매개 변수로 min값 남기기
filtered = activity.drop_duplicates(subset='player_id', keep = 'first')
# column명 변경
filtered.rename(columns={'event_date':'first_login'}, inplace=True)
result = filtered.loc[:,['player_id', 'first_login']]
return result
#2
import pandas as pd
def game_analysis(activity: pd.DataFrame) -> pd.DataFrame:
# player_id 기준 그룹화, event_date min값 가져오고 정수 인덱스 설정
grouped = activity.groupby('player_id')['event_date'].min().reset_index()
# column명 변경
grouped.rename(columns={'event_date':'first_login'}, inplace = True)
return grouped
파이썬을 독학하시는 분들에게 도움이 되길 바라며,
혹 더 좋은 방법이 있거나 오류가 있다면 편하게 말씀 부탁드립니다.
728x90