Category: Deliver Data in Parquet Files

Implement a Partition Strategy for Analytical Workloads – The Storage of DataImplement a Partition Strategy for Analytical Workloads – The Storage of Data

When you begin to brainstorm the storage of data for an analytical workload, terms such as hybrid transaction/analytical processing (HTAP) and online analytical processing (OLAP) might come to mind. Both [...]

Azure Synapse Analytics Data Hub Data Flow – The Storage of DataAzure Synapse Analytics Data Hub Data Flow – The Storage of Data

DROP TABLE brainwaves.DimELECTRODE 2. Create an SCD table, and then execute the following SQL script, which is located in the folder Chapter04/Ch04Ex09 on GitHub at https://github.com/benperk/ADE and named createSlowlyChangingDimensionTable.sql:  CREATE [...]

Build External Tables on a Serverless SQL Pool – The Storage of DataBuild External Tables on a Serverless SQL Pool – The Storage of Data

COLLATE Latin1_General_100_BIN2_UTF8 WITH (LOCATION = ‘abfss://@.dfs.core.windows.net’) WITH (FORMAT_TYPE = PARQUET) ([Timestamp] NVARCHAR(50),[AF3theta] NVARCHAR(50),[AF3alpha] NVARCHAR(50),[AF3betaL] NVARCHAR(50),…) WITH(LOCATION = ‘EMEA/brainjammer/out/2022/04/03//.parquet/*’,DATA_SOURCE = SampleBrainwavesSource,FILE_FORMAT = SampleBrainwavesParquet) FIGURE 4.33 Building an external table You might [...]

Azure Synapse Analytics Develop Hub Notebook – The Storage of DataAzure Synapse Analytics Develop Hub Notebook – The Storage of Data

df = spark.read.option(“header”,”true”) \ .csv(‘abfss://@.dfs.core.windows.net/EMEA/brainjammer/in/2022/04/01/18/*’) display(df.limit(10)) FIGURE 4.21 Azure Synapse Analytics Develop hub load Notebook FIGURE 4.22 Azure Synapse Analytics Develop hub write Notebook Parquet files In this exercise you [...]