{"author_url":"https://blog.hatena.ne.jp/yohei-a/","url":"https://yohei-a.hatenablog.jp/entry/20210612/1623474860","description":"Glue PySpark \u3067 CSV \u306e\u30ab\u30e9\u30e0\u5185\u306e\u6539\u884c\u30b3\u30fc\u30c9\u3092\u7f6e\u63db\u3059\u308b\u4f8b\u3002Spark \u3067\u306f\u6b63\u898f\u8868\u73fe\u306f Java \u306e\u8a18\u6cd5\u306b\u306a\u308b\u3002 newDf = df.withColumn(\"col2\", regexp_replace(col(\"col2\"), \"\\\\n|\\\\r\", \" \")) \u30b5\u30f3\u30d7\u30eb\u30b3\u30fc\u30c9\u5168\u91cf import sys from awsglue.transforms import * from awsglue.utils import getResolvedOptions from pyspark.context import SparkContext from awsglue.context \u2026","blog_url":"https://yohei-a.hatenablog.jp/","author_name":"yohei-a","published":"2021-06-12 14:14:20","version":"1.0","height":"190","width":"100%","type":"rich","categories":["Spark"],"title":"Glue PySpark \u3067 CSV \u306e\u30ab\u30e9\u30e0\u5185\u306e\u6539\u884c\u30b3\u30fc\u30c9\u3092\u7f6e\u63db\u3059\u308b","provider_url":"https://hatena.blog","blog_title":"ablog","html":"<iframe src=\"https://hatenablog-parts.com/embed?url=https%3A%2F%2Fyohei-a.hatenablog.jp%2Fentry%2F20210612%2F1623474860\" title=\"Glue PySpark \u3067 CSV \u306e\u30ab\u30e9\u30e0\u5185\u306e\u6539\u884c\u30b3\u30fc\u30c9\u3092\u7f6e\u63db\u3059\u308b - ablog\" class=\"embed-card embed-blogcard\" scrolling=\"no\" frameborder=\"0\" style=\"display: block; width: 100%; height: 190px; max-width: 500px; margin: 10px 0px;\"></iframe>","image_url":null,"provider_name":"Hatena Blog"}