pyspark-如何分别初始化常见的dataframereader选项?

368yc8dk  于 2021-05-27  发布在  Spark
关注(0)|答案(1)|浏览(331)

我用相同的选项多次读取数据。有没有办法避免重复常见的dataframereader选项,并以某种方式分别初始化它们,以便在以后每次读取时使用它们?

metrics_df = spark.read.format("jdbc") \
        .option("driver", self.driver) \
        .option("url", self.url) \
        .option("user", self.username) \
        .option("password", self.password) \
        .load()
zwghvu4y

zwghvu4y1#

定义所有选项 dataframereader<class 'pyspark.sql.readwriter.DataFrameReader'> 然后添加dbtable选项以重用dataframereader。 Example: ```
metrics_df_options = spark.read.format("jdbc")
.option("driver", self.driver)
.option("url", self.url)
.option("user", self.username)
.option("password", self.password)

type(metrics_df_options)

<class 'pyspark.sql.readwriter.DataFrameReader'>

configure dbtable and pull data from rdbms table

metrics_df_options.option("dbtable","<table_name>").load().show()

相关问题