Posts

Showing posts from 2022

Access to Azure blobs from Azure Function by using Managed Identity

How to grant access to the blob files available on the Azure Storage Account without providing a SAS token nor Access Key? In some cases, it might be helpful, especially when access to the Azure Storage Account is available via Managed Instance.  In that case, we don't want to use any secrets provided explicitly.  In fact, we still want to use the SAS token behind the scene, but generated on-the-fly, with a very short lifetime.  Each SAS token requires to be built based on the Access Key. We may want to use one of 2 access keys provided by the Storage Accounts, but it's not a good option as we don't want to deal with them.  Azure lets us generate a temporary access key, based on our credentials, and then use it to generate a SAS token. In that case, we need to: Create a Storage Account Create Azure Function App  Assign the Storage Blob Delegator  (or Storage Blob Data Owner ) role to the Manage Identity of the Function App. The action needs to be performed ...

Azure Data Scientist Associate

Image
 The last week only, I renewed my Azure Data Scientist Associate certification. Before the first re-attempt, I promised myself to spend some time on preparation. Was it worth it ? www.credly.com Of course! I love it. The preparation for certification forces me to take a closer look at the aspects that are out of my daily interest. In my current project, the limited part of Azure ML was in use - the Data Drift, which is a small piece of the bigger piece. During my study, I've walked through the whole journey once again, starting from data preparation, and ending up with the model deployment on the cloud. This revealed the mystery, of why I wasn't able to convince my client since the last year to go deeper into that platform, despite the fact, that the project itself is data science-driven  and already includes all the juicy parties,  like model training (PKL) and data/feature preparation.  The answer was simple. Following the bible - "And how can they hear about som...

Podman using Ubuntu WSL2

Image
The Docker Desktop for Windows is no longer free for commercial usage, or at least in many cases. If your local development environment is Windows, and you're looking for an alternative, this article is for you. "Docker Desktop remains free for small businesses (fewer than 250 employees AND less than $10 million in annual revenue), personal use, education, and non-commercial open-source projects.". Unfortunately, containerization is a foundation of most enterprise-level projects these days, which size causes the licence issue.  The commercial doesn't cost an arm and leg, even though it builds unnecessary concern and limitations. Even if you use the WSL2 Linux image, you need to use the docker engine, which is a part of the Docker Desktop for Windows. Podman as Docker And here is the beautiful moment when Podman comes into the play. It's a root-less and demon-less solution, fully compatible with docker. If you have been ever concerned about the demon working in the...

Azure Purview - to own your data

Image
While exploring the problem of migration to the cloud I've realized that it would be worth putting some light on the data owning issue. This is when the Azure Purview comes in. I'm not a Chief Data Officer, but I if had been the one, would have been asked some fundamental questions about my data - what is the source, how the ETL/ELT process looked like, who own the data ... and, what is the data linage..., and finally what all those data mean.  All those questions are very important these days. Those might be triggered by regulatory requirements (data lineage, anonymisation), data scientists implementations (Where is my source dataset used to train the model?), internal auditors or the business itself, who struggles with defining the common domain models for years. By working for one of my client's I've faced the problem of defining the Product Master application, which would be a golden source of domain model definitions across the company. The issue was too big to be ...