DATA
27.9K views | +4 today
Follow
 
Scooped by Mickael Ruau
onto DATA
Scoop.it!

Open-data et format

Open-data et format | DATA | Scoop.it
Il n’y a pas de doute, l’open-data est un sujet qui a une place importante aujourd’hui dans l’éco-système numérique. Il fait un peu suite, à mon avis, à la montée en puissance de l’open-source.
No comment yet.
DATA
Your new post is loading...
Your new post is loading...
Scooped by Mickael Ruau
Scoop.it!

Rapport CIGREF : Enjeux business des données

Rapport CIGREF : Enjeux business des données | DATA | Scoop.it
Ce rapport CIGREF « Enjeux business des données » propose non seulement « une méthodologie de gestion des données, avec des exemples concrets de mise en œuvre et de bonnes pratiques, une ouverture sur une démarche de valorisation des données », mais il est complété par un outil d’auto-évaluation de la maturité des entreprises en matière de gestion des données.
No comment yet.
Scooped by Mickael Ruau
Scoop.it!

I Heart Logs: Event Data, Stream Processing & Data Integration

I Heart Logs: Event Data, Stream Processing & Data Integration | DATA | Scoop.it
Jay Kreps, CEO of Confluent and co-creator of Apache Kafka, shows you how logs work in distributed systems, and provide practical applications of these concepts in a variety of common use cases.

Download the free ebook and:

Learn how logs are used for programmatic access in databases and distributed systems
Understand why logs are at the heart of real-time stream processing
Learn how logs and stream-processing can form a backbone for data flow and real-time data processing
Discover solutions to the data integration problem when more diverse data needs to be accessed by more specialized systems
No comment yet.
Scooped by Mickael Ruau
Scoop.it!

Essential Apache HBase - DZone - Refcardz

Essential Apache HBase - DZone - Refcardz | DATA | Scoop.it
HBase Tutorial - Learn HBase quickly with this beginner's introduction to the Hadoop database: a distributed, scalable Big Data store for managing very large tables.
No comment yet.
Scooped by Mickael Ruau
Scoop.it!

Getting Started with BIRT - DZone - Refcardz

Getting Started with BIRT - DZone - Refcardz | DATA | Scoop.it
Provides an overview of the BIRT 3.7 components, focusing on a few key capabilities of the BIRT Designer, BIRT Runtime APIs, and BIRT Web Viewer.
No comment yet.
Scooped by Mickael Ruau
Scoop.it!

Practical Data Mining with Python - DZone - Refcardz

Practical Data Mining with Python - DZone - Refcardz | DATA | Scoop.it
Covers the tools used in practical Data Mining for finding and describing structural patterns in data using Python.
No comment yet.
Scooped by Mickael Ruau
Scoop.it!

Understanding Data Quality - DZone - Refcardz

Understanding Data Quality - DZone - Refcardz | DATA | Scoop.it
Data is one of the single most important resources for an organization. It can be used to help your business run smoothly, implement new strategies, and more. This Refcard will show you the key places data derives from, characteristics of high-quality data, and the five phases of a data quality strategy that you can follow.
No comment yet.
Scooped by Mickael Ruau
Scoop.it!

Introduction to Text Mining - DZone - Refcardz

Introduction to Text Mining - DZone - Refcardz | DATA | Scoop.it
Thanks to text mining, you can extract information from written text. This is something we do, naturally, every day, in conversations or when we read. Like driving a car, once we learn how to do it, we take it for granted. This Refcard will introduce text mining, as well as key methods and techniques for success.
No comment yet.
Scooped by Mickael Ruau
Scoop.it!

GAIA-X Catalogue search engine – under the hood

GAIA-X Catalogue search engine – under the hood | DATA | Scoop.it
In today’s world, and now even more so with the recent quarantine measures, we rely on cloud services for running our infrastructures and storing our data. The GAIA-X initiative is born from a need to raise data sovereignty awareness and create a federated trustworthy cloud ecosystem for Europe. In this article, we will discuss how … GAIA-X Catalogue search engine – under the hood Read More »
No comment yet.
Scooped by Mickael Ruau
Scoop.it!

Things I Wished More Developers Knew About Databases

A large majority of computer systems have some state and are likely to depend on a storage system. My knowledge on databases accumulated over time, but along the way our design mistakes caused data…
No comment yet.
Scooped by Mickael Ruau
Scoop.it!

OVH News - Innovation : les serveurs dédiés connectés en 40 Gbps Hadoop et OpenStack Swift

L'actualité et l'expertise technique du leader européen du cloud. Billets techniques et articles de fond sur la révolution numérique.
Mickael Ruau's insight:

Des clients d'OVH ont mis en place des clusters Hadoop et Openstack Swift, avec des volumes de données pouvant dépasser 100 PB et, très souvent, en configuration multidatacentre. C'est là que le besoin en 40 Gbps se ressent le plus. Ces réseaux horizontaux, y compris entre les datacentres, sont donc forcés à évoluer en mixant le 40 et le 100 Gbps intelligemment, c’est-à-dire en fonction du temps de latence qui sépare les serveurs.

L'un des premiers clients à utiliser ces serveurs en 40 Gbps est hubiC, le service de stockage dans le cloud d’OVH.

No comment yet.
Scooped by Mickael Ruau
Scoop.it!

Dremio is the data lake engine - Dremio

Dremio is the data lake engine - Dremio | DATA | Scoop.it
Get more value from your data, faster. Dremio makes your data engineers more productive, and your data consumers more self-sufficient.
No comment yet.
Scooped by Mickael Ruau
Scoop.it!

Integration of Apache Hadoop With OpenStack Swift

Integration of Apache Hadoop With OpenStack Swift | DATA | Scoop.it
The Topic Integration of Apache Hadoop With OpenStack Swift Is Not Exactly New. You Can Follow Our Guide Specially For Handling OpenStack.
No comment yet.
Scooped by Mickael Ruau
Scoop.it!

Find Open Datasets and Machine Learning Projects | Kaggle

Download Open Datasets on 1000s of Projects + Share Projects on One Platform. Explore Popular Topics Like Government, Sports, Medicine, Fintech, Food, More. Flexible Data Ingestion.
No comment yet.
Scooped by Mickael Ruau
Scoop.it!

JupyterCon2020

JupyterCon2020 | DATA | Scoop.it
JupyterCon brings together data scientists, business analysts, researchers, educators, developers, core Project contributors, and tool creators for in-depth training, insightful keynotes, and practical talks exploring the Project Jupyter platform.
No comment yet.
Scooped by Mickael Ruau
Scoop.it!

Nettoyage et préparation des données, corvée n°1 des data scientists - Le Monde Informatique

Nettoyage et préparation des données, corvée n°1 des data scientists - Le Monde Informatique | DATA | Scoop.it


L’une des conclusions du rapport ne devrait surprendre personne : Python reste en tête des langages utilisés dans le domaine de la science des données. Le langage R arrive en seconde position, tandis que JavaScript, Java, C/C++ et C# sont loin derrière. Le langage Julia, dont la popularité s’affirme pourtant dans le monde de la science des données, n’arrive pas encore à se positionner dans la course. Mais on ne sait pas si c'est parce qu’il n'a pas été suffisamment cité par les personnes interrogées ou s’il n’a pas été mentionné par l'enquête
No comment yet.
Scooped by Mickael Ruau
Scoop.it!

Database Partitioning with MySQL - DZone - Refcardz

Database Partitioning with MySQL - DZone - Refcardz | DATA | Scoop.it
Provides an overview of the MySQL database partitioning techniques and how these techniques lead to operational excellence.
No comment yet.
Scooped by Mickael Ruau
Scoop.it!

Machine Learning - DZone - Refcardz

Machine Learning - DZone - Refcardz | DATA | Scoop.it
Covers machine learning for predictive analytics, explains setting up training and testing data, and offers machine learning model snippets.
No comment yet.
Scooped by Mickael Ruau
Scoop.it!

Understanding Stream Processing - DZone - Refcardz

Understanding Stream Processing - DZone - Refcardz | DATA | Scoop.it
Stream processing is incredibly useful for processing big data volumes and providing useful insights in real-time. This is especially important when the value of the information in the data decreases as it gets older. This Refcard covers the building blocks of a stream processing solution, use cases that benefit from stream processing, and hands-on examples using Hazelcast Jet.
No comment yet.
Scooped by Mickael Ruau
Scoop.it!

Data Warehousing - DZone - Refcardz

Data Warehousing - DZone - Refcardz | DATA | Scoop.it
As a total architecture, data warehousing provides decision-support data that is consistent, integrated, standard, and simply understood. From descriptions to diagrams and integration patterns, this newly updated Refcard walks you through each aspect of data warehousing. Gain a complete understanding of data modeling, infrastructure, relationships, attributes, and speedy history loading and recording with atomic data.
No comment yet.
Scooped by Mickael Ruau
Scoop.it!

Deep learning vs Machine learning : quelle est la différence ? - IONOS

Deep learning vs Machine learning : quelle est la différence ? - IONOS | DATA | Scoop.it
Deep learning vs machine learning : deux méthodes pour équiper un ordinateur d’une intelligence artificielle. Apprenez-en plus sur leurs différences et leurs champs d’application.
No comment yet.
Scooped by Mickael Ruau
Scoop.it!

sqlancer/sqlancer: Detecting Logic Bugs in DBMS

sqlancer/sqlancer: Detecting Logic Bugs in DBMS | DATA | Scoop.it
Detecting Logic Bugs in DBMS. Contribute to sqlancer/sqlancer development by creating an account on GitHub.
No comment yet.
Scooped by Mickael Ruau
Scoop.it!

Free Elastic Training

Free Elastic Training | DATA | Scoop.it

We're releasing free and open on-demand courses over the next few weeks. We know social distancing isn't fun, but it can be a great opportunity to learn new things. So while other people are making a second pass through their Netflix queue, you can build your Elastic Stack, observability, and security skills and come out the other side an expert.

No comment yet.
Scooped by Mickael Ruau
Scoop.it!

Hadoop OpenStack Support: Swift Object Store

OpenStack is an open source cloud infrastructure which can be accessed from multiple public IaaS providers, and deployed privately. It offers infrastructure services such as VM hosting (Nova), authentication (Keystone) and storage of binary objects (Swift).

This module enables Apache Hadoop applications -including MapReduce jobs, read and write data to and from instances of the OpenStack Swift object store

No comment yet.
Scooped by Mickael Ruau
Scoop.it!

Performance Analysis and Troubleshooting Methodologies for Databases

Slides from Peter Zaitsev's talk at the Big Data / Data Science Meetup in Montpellier, France, venue sponsor Elium Tech
No comment yet.