Quit Emailing Yourself

# optimization → spark

2 links tagged with all of: optimization + spark

Links

[no-title]

The article discusses the complexities and challenges associated with configuring Spark, a popular data processing framework. It highlights various configuration options, their implications, and the often confusing nature of Spark's settings, making it difficult for users to optimize their applications effectively. The author emphasizes the importance of understanding these configurations to harness Spark's full potential.

Saved by tldr-importer · Last saved October 29, 2025 · 1 min read

spark ✓ + configuration + data-processing optimization ✓ + framework

Accelerating LinkedIn Sales Navigator's search system with Spark transformations

LinkedIn optimized its Sales Navigator search pipeline by migrating from MapReduce to Spark, reducing execution time from 6-7 hours to approximately 3 hours. The optimization involved pruning job graphs, identifying bottlenecks, and addressing data skewness to enhance efficiency across over 100 data manipulation jobs. This transformation significantly improves the speed at which users can access updated search results.

Saved by tldr-importer · Last saved October 29, 2025 · 6 min read

spark ✓ + data-pipeline + performance optimization ✓ + linkedin