如何利用散点图矩阵进行数据可视化

# Seaborn visualization library
import seaborn as sns
# Create the default pairplot
sns.pairplot(df)

# Take the log of population and gdp_per_capita
df['log_pop'] = np.log10(df['pop'])
df['log_gdp_per_cap'] = np.log10(df['gdp_per_cap'])
# Drop the non-transformed columns
df = df.drop(columns = ['pop', 'gdp_per_cap'])

sns.pairplot(df, hue = 'continent')

# Create a pair plot colored by continent with a density plot of the # diagonal and format the scatter plots.
sns.pairplot(df, hue = 'continent', diag_kind = 'kde',
             plot_kws = {'alpha': 0.6, 's': 80, 'edgecolor': 'k'},
             size = 4)

# Plot colored by continent for years 2000-2007
sns.pairplot(df[df['year'] >= 2000], 
             vars = ['life_exp', 'log_pop', 'log_gdp_per_cap'], 
             hue = 'continent', diag_kind = 'kde', 
             plot_kws = {'alpha': 0.6, 's': 80, 'edgecolor': 'k'},
             size = 4);
# Title 
plt.suptitle('Pair Plot of Socioeconomic Data for 2000-2007', 
             size = 28);

# Create an instance of the PairGrid class.
grid = sns.PairGrid(data= df_log[df_log['year'] == 2007],
                    vars = ['life_exp', 'log_pop', 
                    'log_gdp_per_cap'], size = 4)

# Map a scatter plot to the upper triangle
grid = grid.map_upper(plt.scatter, color = 'darkred')

# Map a histogram to the diagonal
grid = grid.map_diag(plt.hist, bins = 10, color = 'darkred', 
                     edgecolor = 'k')
# Map a density plot to the lower triangle
grid = grid.map_lower(sns.kdeplot, cmap = 'Reds')

# Function to calculate correlation coefficient between two arrays
def corr(x, y, **kwargs):
    # Calculate the value
    coef = np.corrcoef(x, y)[0][1]
    # Make the label
    label = r'$\rho$ = ' + str(round(coef, 2))
    # Add the label to the plot
    ax = plt.gca()
    ax.annotate(label, xy = (0.2, 0.95), size = 20, xycoords = ax.transAxes)
# Create a pair grid instance
grid = sns.PairGrid(data= df[df['year'] == 2007],
                    vars = ['life_exp', 'log_pop', 'log_gdp_per_cap'], size = 4)

推荐文章

性感的西装 · SPSS字符串怎么改成数值型 SPSS字符串怎么变数值-IBM SPSS Statistics 中文网站

2 周前

刚毅的莴苣 · scatter3 - 三维散点图 - MATLAB

2 周前

性感的橙子 · Matlab散点图(多个y值)_Matlab散点图集x轴和y轴_在python中绘制多个Y轴+ 'hue‘散点图 - 腾讯云开发者社区 - 腾讯云

3 周前

飘逸的烈马 · Matplotlib绘制炫酷散点图：从二维到三维，再到散点图矩阵的完整指南与实战【第58篇—python：Matplotlib绘制炫酷散点图】_Python资料_Python教程开发文档资料-Pyth

4 周前

重感情的椰子 · 使用 plt.scatter() 在 Python 中可视化数据【生长吧！Python!】-云社区-华为云

1 月前

爱听歌的土豆 · (转)下载网页中的SVG矢量图标文件_svg图标文件-CSDN博客

3 周前

坏坏的小狗 · 航天爱斯诺-室内滑雪机

1 月前

读研的盒饭 · INITCAP 函数 - Amazon Redshift

3 月前

侠义非凡的大象 · 中国在非企业社会责任联盟“春之约”招待会暨“百企千村”活动在非履责活动交流会在京成功举办 - Alliance of Chinese Business in Africa for Social Res

3 月前

暗恋学妹的煎鸡蛋 · Amazon Athena Cloudera Impala 连接器 - Amazon Athena

5 月前