Hello,
I want to get metadate from my Azure datalake using their Blob API.
I wasn't seeing any data in the Ingest storage so I turned on cashing to file, to try to see what's happening.
There are three files in my cashing folder:
- Data_.raw: The return of the call, i.e. my actual data. This look excellent, except that it's a .raw file. Contents:
<?xml version="1.0" encoding="utf-8"?>
<EnumerationResults ServiceEndpoint="https://xxxx.blob.core.windows.net/" ContainerName="datalake">
<Prefix>my_prefix</Prefix>
<Blobs>
<Blob>
....
</Blob>
</Blobs>
<NextMarker/>
</EnumerationResults> - Data_.xml: Basically the same as the Data_.raw, but with the content of Data_.raw as the data of a value-element. The data also contains the XML header (so now the document has two headers) and the brackets have been encoded (i.e. all the `<` are now `<`).
<?xml version="1.0" encoding="utf-8"?>
<Table_flattening_name>
<value>
<?xml version="1.0" encoding="utf-8"?>
<EnumerationResults
ServiceEndpoint="https://xxxx.blob.core.windows.net/"
ContainerName="datalake">
<Prefix>my_prefix</Prefix>
<Blobs>
<Blob>
...
</Blob>
</Blobs>
<NextMarker />
</EnumerationResults>
</value>
</Table_flattening_name> -
Data_transformed_1.xml: The result of my XSLT on Data_.xml
Data_transformed_1.xml contains one empty element, which is caused by Data_.xml being malformed.
I can't really figure out what's going on. In other APIs I only had two files. Not sure what the Data_.raw file is doing, but everything would work if that file were Data_.xml.
What could be causing this? Why is there a Data_.raw file? How can I fix this?