Hello,
I want to get metadate from my Azure datalake using their Blob API.
I wasn't seeing any data in the Ingest storage so I turned on cashing to file, to try to see what's happening.
There are three files in my cashing folder:
- Data_.raw: The return of the call, i.e. my actual data. This look excellent, except that it's a .raw file. Contents:
<?xml version="1.0" encoding="utf-8"?> <EnumerationResults ServiceEndpoint="https://xxxx.blob.core.windows.net/" ContainerName="datalake"> <Prefix>my_prefix</Prefix> <Blobs> <Blob> .... </Blob> </Blobs> <NextMarker/> </EnumerationResults>
- Data_.xml: Basically the same as the Data_.raw, but with the content of Data_.raw as the data of a value-element. The data also contains the XML header (so now the document has two headers) and the brackets have been encoded (i.e. all the `<` are now `<`).
<?xml version="1.0" encoding="utf-8"?> <Table_flattening_name> <value> <?xml version="1.0" encoding="utf-8"?> <EnumerationResults ServiceEndpoint="https://xxxx.blob.core.windows.net/" ContainerName="datalake"> <Prefix>my_prefix</Prefix> <Blobs> <Blob> ... </Blob> </Blobs> <NextMarker /> </EnumerationResults> </value> </Table_flattening_name>
-
Data_transformed_1.xml: The result of my XSLT on Data_.xml
Data_transformed_1.xml contains one empty element, which is caused by Data_.xml being malformed.
I can't really figure out what's going on. In other APIs I only had two files. Not sure what the Data_.raw file is doing, but everything would work if that file were Data_.xml.
What could be causing this? Why is there a Data_.raw file? How can I fix this?