Operations | Monitoring | ITSM | DevOps | Cloud

Analytics

Tutorial: How to Connect Jupyter Notebooks to Ocean for Apache Spark

Jupyter Notebook is a web-based interactive computational environment for creating notebook documents. It supports programming languages – such as Python, Scala, R – and is largely used for data engineering, data analysis, machine learning, and further interactive, exploratory computing. Think of notebooks like a developer console or terminal, but with an intuitive UI that allows for efficient iteration, debugging or exploration.

How Big Data Is Helping Advertisers Solve Problems

Big data is transforming the relationship between companies and customers. Analyzing large amounts of data for marketing purposes is not new, but recent advancements in big data technology have given advertisers powerful new ways of understanding consumers’ behaviors, needs and preferences. Big data helps you optimize each customer’s demands and convert them into prospective purchasers.

Using InfluxDB as an IoT Edge Historian

InfluxDB is increasingly being used in IoT solutions to store data from connected devices. Now it can also be used on IoT edge gateways as a data historian to analyze, visualize and eventually transmit aggregated IoT data up to a centralized server. In this article we’re going to look at three simple ways you can connect an instance of InfluxDB on your IoT Edge device to another instance of InfluxDB in the cloud.

10 Best Snowflake Monitoring Tools (Updated 2022)

The Snowflake data cloud offers powerful data warehousing, analytics, and processing tools. The platform can handle many data workloads on one platform, helping organizations turn data into actionable insight across teams, departments, and regions. Snowflake’s architecture is also unique. Compute and storage are completely independent and both are highly elastic. Yet, Snowflake's per-second billing and highly elastic compute model demand frequent usage and cost monitoring.

Why Enterprises Need Self Service Analytics

Self service analytics are becoming increasingly popular and essential in this data-driven world. For many businesses, there is a growing need for their internal departments to access their data and business intelligence and harness its power themselves. Traditionally, business intelligence processes are the purview of IT teams and data specialists.

The Big Problem With Big Data

The 2022 Australian Open men’s singles final between Rafael Nadal and Daniil Medvedev was epic! I watched it in Australia, which meant I was up until 2 a.m. on a Sunday night. Monday morning’s dog walk was sub-joyful. Medvedev seemed unplayable, taking the first two sets comfortably. But Nadal is a legend for a reason. Point by point, he overran his rival to make history.

Start with Python and InfluxDB

Although time series data can be stored in a MySQL or PostgreSQL database, that’s not particularly efficient. If you want to store data that changes every minute (that’s more than half a million data points a year!) from potentially thousands of different sensors, servers, containers, or devices, you’re inevitably going to run into scalability issues. Querying or performing aggregation on this data also leads to performance issues when using relational databases.