ADF degree of parallelism

Inquisitive 60 Reputation points
2025-12-09T16:38:17.35+00:00

hello sir, I have 10 tables to fetch data from, and I have lookup and foreach activity to iterate through each table with single copy activity and set degree of parallelism to 5 to concurrent data process. I have also set foreach batch count to 15. How would degree of parallelism work in this scenario as degree of parallelism and batch count is in contention. Thank you

Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
0 comments No comments
{count} votes

Answer accepted by question author
  1. Pilladi Padma Sai Manisha 575 Reputation points Microsoft External Staff Moderator
    2025-12-09T17:30:02.0866667+00:00

    Hi Inquisitive,

    It sounds like you're trying to manage the degree of parallelism in Azure Data Factory (ADF) for your copy and foreach activities, and you're curious about how these settings interact when fetching data from multiple tables.

    Here's a bit of insight:

    1. Degree of Parallelism: This setting controls how many concurrent copies ADF will attempt to run. In your case, you mentioned setting it to 5. This means ADF can process up to 5 concurrent copy operations at any given time. However, be mindful that if this number is set too high, it may potentially lead to performance issues or even throttling, especially if you're working with sources that have a maximum limit (like 32 queries at once).
    2. ForEach Batch Count: The batch count decides how many items are processed in parallel within the foreach activity. You've set this to 15. What this means is that ADF can initiate up to 15 parallel 'queues', but remember that these queues run sequentially in terms of the items they process.

    In your scenario:

    • You can run multiple copy activities simultaneously, but you're capped by both your degree of parallelism (5) and how the foreach activity constructs its queues (up to 15). So, effectively, while you have the capability for many queues, the actual number of concurrent activities will be limited by the degree of parallelism you've set.

    Recommendations:

    Monitor Performance: Keep an eye on the performance metrics to ensure that your pipeline is running efficiently and adjust the degree of parallelism if you notice any bottlenecks.

    Testing: Test different configurations to find the sweet spot that works best for your workload.

    Feel free to reach out if you need more specific guidance or have further questions!

    References:

    Hope this helps!

    0 comments No comments

0 additional answers

Sort by: Most helpful

Your answer

Answers can be marked as 'Accepted' by the question author and 'Recommended' by moderators, which helps users know the answer solved the author's problem.