New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

feat: démarrer avec apache spark #1131

Merged

ElevenTom merged 11 commits into master from feat-demarrer-apache-spark

Jul 8, 2024

Member

lepiaf commented Jun 26, 2024

No description provided.

lepiaf added 4 commits

June 26, 2024 16:37


          Démarrer avec Apache Spark

c976849


          Update 2024-07-12-demarrer-apache-spark.md

5e91d54


          Update 2024-07-12-demarrer-apache-spark.md

7f9c768


          Update 2024-07-12-demarrer-apache-spark.md

5b6418d

lepiaf added the publication label

lepiaf temporarily deployed to 1131/merge

June 26, 2024 16:28

— with

GitHub Actions Inactive


          Update 2024-07-12-demarrer-apache-spark.md

d0f2871

lepiaf temporarily deployed to 1131/merge

June 26, 2024 16:32

— with

GitHub Actions Inactive

github-actions bot temporarily deployed to feat-demarrer-apache-spark

June 26, 2024 16:32

Destroyed

github-actions bot temporarily deployed to feat-demarrer-apache-spark

June 26, 2024 16:36

Destroyed

Contributor

github-actions bot commented Jun 26, 2024 •

edited

Loading

⚡️🏠 Lighthouse report

Here's the summary:

Path	Performance	Accessibility	Best practices	SEO	PWA
/feat-demarrer-apache-spark/	🟢 92	🟢 90	🟢 100	🟢 92	🟠 70
/feat-demarrer-apache-spark/fr/authors/ajacquemin/	🟠 71	🟢 90	🟢 100	🟢 92	🟠 70
/feat-demarrer-apache-spark/fr/comment-construire-site-web-avec-nextjs/	🟠 76	🟠 80	🟢 100	🟢 100	🟠 70
/feat-demarrer-apache-spark/fr/nestjs-le-cycle-de-vie-dune-requete/	🟠 73	🟠 80	🟢 100	🟢 97	🟠 70

Here's the audits:

Path	FCP 1800 <=	LCP 2500 <=	Speed Index 3400 <=	TTI 3800 <=	TBT 200 <=	CLS 0.1 <=
/feat-demarrer-apache-spark/	🔴 1864	🟢 1864	🟢 1864	🔴 4134	🟢 8	🔴 0.12
/feat-demarrer-apache-spark/fr/authors/ajacquemin/	🔴 2452	🟢 2452	🟢 2736	🔴 4175	🟢 8	🟢 0.01
/feat-demarrer-apache-spark/fr/comment-construire-site-web-avec-nextjs/	🔴 2425	🔴 2621	🟢 2425	🔴 4128	🟢 8	🟢 0.03
/feat-demarrer-apache-spark/fr/nestjs-le-cycle-de-vie-dune-requete/	🔴 1989	🔴 2892	🟢 2423	🔴 3960	🟢 8	🟢 0.04

lepiaf added 2 commits

June 26, 2024 18:41


          Update 2024-07-12-demarrer-apache-spark.md

ec8bbde


          Update 2024-07-12-demarrer-apache-spark.md

1db0d52

lepiaf temporarily deployed to 1131/merge

June 26, 2024 16:43

— with

GitHub Actions Inactive

github-actions bot temporarily deployed to feat-demarrer-apache-spark

June 26, 2024 16:46

Destroyed

github-actions bot temporarily deployed to feat-demarrer-apache-spark

June 26, 2024 16:47

Destroyed

lepiaf changed the title ~~Feat demarrer apache spark~~ feat: démarrer avec apache spark

lepiaf requested a review from ElevenTom

June 28, 2024 10:29

lepiaf added the status/reviewable label


          Update 2024-07-12-demarrer-apache-spark.md

0d350f8

lepiaf temporarily deployed to 1131/merge

June 28, 2024 14:41

— with

GitHub Actions Inactive

github-actions bot temporarily deployed to feat-demarrer-apache-spark

June 28, 2024 14:45

Destroyed

Cindyvlv reviewed

View reviewed changes

_articles/fr/2024-07-12-demarrer-apache-spark.md Outdated

+              excerpt: >-
+                Le domaine de la data est présent au quotidient. La quantité de donnée est si grande que nous la nommons Big Data.
+                Dans cet article, nous verrons comment traiter ce volume de données à l'aide du framework Apache Spark.
+              categories: []

Contributor

Cindyvlv Jul 5, 2024

Suggested change

      
            categories: []
          
            categories: [architecture]

_articles/fr/2024-07-12-demarrer-apache-spark.md Outdated

+              categories: []
+              authors:
+                - tthuon
+              keywords: []

Contributor

Cindyvlv Jul 5, 2024

Suggested change

      
            keywords: []
          
            keywords: [
          
            - apache spark
          
            - data
          
            - big data
          
            ]

_articles/fr/2024-07-12-demarrer-apache-spark.md Outdated

+              slug: demarrer-apache-spark
+              title: Démarrer avec Apache Spark
+              excerpt: >-
+                Le domaine de la data est présent au quotidient. La quantité de donnée est si grande que nous la nommons Big Data.

Contributor

Cindyvlv Jul 5, 2024

Suggested change

      
              Le domaine de la data est présent au quotidient. La quantité de donnée est si grande que nous la nommons Big Data.
          
              Le domaine de la data est présent au quotidien. La quantité de donnée est si grande que nous la nommons Big Data.

_articles/fr/2024-07-12-demarrer-apache-spark.md Outdated

+              keywords: []
+              ---
+              Lorsque l'on travaille dans l'univers de la data, nous effectuons principalements sur ces trois étapes :

Contributor

Cindyvlv Jul 5, 2024

Suggested change

      
            Lorsque l'on travaille dans l'univers de la data, nous effectuons principalements sur ces trois étapes :
          
            Lorsque l'on travaille dans l'univers de la data, nous effectuons principalement sur ces trois étapes :

_articles/fr/2024-07-12-demarrer-apache-spark.md Outdated

+              ---
+              Lorsque l'on travaille dans l'univers de la data, nous effectuons principalements sur ces trois étapes :
+              - extraire la données de la source

Contributor

Cindyvlv Jul 5, 2024

Suggested change

      
            - extraire la données de la source
          
            - extraire la donnée de la source

_articles/fr/2024-07-12-demarrer-apache-spark.md Outdated


		Par simplicité, nous nommerons Spark pour désigner Apache Spark.

		## Mise en situation

Contributor

Cindyvlv Jul 5, 2024

Suggested change

      
            ## Mise en situation
          
            ## Etape 1 : Récupération d'une source de données

_articles/fr/2024-07-12-demarrer-apache-spark.md Outdated

+;Pont Haudaudine vers Sud;589;;5;0674 - Pont Haudaudine vers Sud;2021-03-26;Hors Vacances
+              ```
+              ## Installation d'Apache Spark

Contributor

Cindyvlv Jul 5, 2024

Suggested change

      
            ## Installation d'Apache Spark
          
            ## Etape 2 : Installation d'Apache Spark

_articles/fr/2024-07-12-demarrer-apache-spark.md Outdated


		PySpark est installé !

		## Création de notre pipeline ETL avec Apache Spark

Contributor

Cindyvlv Jul 5, 2024

Suggested change

      
            ## Création de notre pipeline ETL avec Apache Spark
          
            ## Etape 3 : Création de notre pipeline ETL avec Apache Spark

_articles/fr/2024-07-12-demarrer-apache-spark.md Outdated

+              - la transformer pour lui donner de la valeur
+              - stocker le résultat
+              ### Lecture de la donnée source

Contributor

Cindyvlv Jul 5, 2024

Suggested change

      
            ### Lecture de la donnée source
          
            ## Etape 4 : Lecture de la donnée source avec Spark

_articles/fr/2024-07-12-demarrer-apache-spark.md Outdated

+              lang: fr
+              date: '2024-07-12'
+              slug: demarrer-apache-spark
+              title: Démarrer avec Apache Spark

Contributor

Cindyvlv Jul 5, 2024

Suggested change

      
            title: Démarrer avec Apache Spark
          
            title: Démarrer avec Apache Spark étapes par étapes

Cindyvlv reviewed

View reviewed changes

_articles/fr/2024-07-12-demarrer-apache-spark.md Outdated


		Bravo, vous venez de créer votre premier pipeline Spark. Un nouveau monde s'ouvre à vous. A travers cet article, nous avons vu l'installation de Spark et PySpark. Avec la création du pipeline, nous avons lu la source de données, effectuées quelques transformation, et enfin stocké la données à un endroit. Ce stockage permettra à d'autre corps de métier de la data de l'exploiter.

		## Références

Contributor

Cindyvlv Jul 5, 2024

Suggested change

      
            ## Références
          
            ### Références

_articles/fr/2024-07-12-demarrer-apache-spark.md Outdated


		Ainsi, dans l'arboresence, nous avons nos données partitionné par date.

		## Conclusion

Contributor

Cindyvlv Jul 5, 2024

Suggested change

      
            ## Conclusion
          
            ## Conclusion


          Different fixes

ac835a8

ElevenTom had a problem deploying to 1131/merge

July 5, 2024 13:34

— with

GitHub Actions Failure

ElevenMarianne reviewed

View reviewed changes

_articles/fr/2024-07-12-demarrer-apache-spark.md Outdated

Comment on lines 13 to 17

+              keywords: [
+              - apache spark
+              - data
+              - big data
+                ]

Contributor

ElevenMarianne Jul 5, 2024

Suggested change

      
            keywords: [
          
            - apache spark
          
            - data
          
            - big data
          
              ]
          
            keywords:
          
            - apache spark
          
            - data
          
            - big data


          suppression crochets

8b3fd0a

ElevenTom temporarily deployed to 1131/merge

July 8, 2024 08:23

— with

GitHub Actions Inactive

github-actions bot temporarily deployed to feat-demarrer-apache-spark

July 8, 2024 08:27

Destroyed


          Update and rename 2024-07-12-demarrer-apache-spark.md to 2024-07-08-d…

17f47b5

…emarrer-apache-spark.md

lepiaf deployed to 1131/merge

July 8, 2024 12:32

— with

GitHub Actions Active

github-actions bot temporarily deployed to feat-demarrer-apache-spark

July 8, 2024 12:35

Destroyed

ElevenTom approved these changes

View reviewed changes

ElevenTom merged commit 006d443 into master

8 checks passed

ElevenTom deleted the feat-demarrer-apache-spark branch

July 8, 2024 14:17

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

publication status/reviewable