decor
 

Planet Big Data logo

Planet Big Data is an aggregator of blogs about big data, Hadoop, and related topics. We include posts by bloggers worldwide. Email us to have your blog included.

 

December 08, 2018


Revolution Analytics

Because it's Saturday: Go Your Own Way

I was delivering a workshop for AI Live yesterday so I didn't get the chance to do my Friday post, but I'm here at SatRDays DC and the playlist on the audio while we're waiting for things to start is...

...
 

December 07, 2018


Forrester Blogs

“Crap” Content Continues To Describe B2B Marketing — Don’t Let It Describe Yours

In 2013, Doug Kessler and the crew at Velocity Partners published “Crap: the single biggest threat to B2B content marketing” — a work of thought leadership genius that I still tell marketers to read...

...
 

December 06, 2018


Forrester Blogs

There’s No “Data Strategy” — Align Insights Priorities To Your Business Strategy

Greetings from sunny but cold Orlando, where the inaugural Forrester Data Strategy & Insights Forum just wrapped up. During the event, I spent time with a seasoned data professional who joined me...

...
 

December 05, 2018


Revolution Analytics

Gender Diversity in the R and Python Communities

Many (if not most) tech communities have far more representation from men than from women (and even fewer from nonbinary folk). This is a shame, because everybody uses software, and these projects...

...

Forrester Blogs

Quantifying Vendor Efficacy Using The MITRE ATT&CK Evaluation

I’ve been extremely excited about the MITRE ATT&CK evaluation since it decided to open it up to vendors earlier this year. The endpoint detection and response (EDR) market represents the...

...

Forrester Blogs

The TV Industry Is Stumbling Toward Customer Centricity

I’ve just returned from a few days in the sun, having once again participated in Beet.TV’s annual Beet Retreat. (Wonderfully, we returned to Puerto Rico this year.*) The theme was “It’s...

...

Forrester Blogs

The Future Of Mobility Is Data, Not Cars

Having worked in and with the automotive industry for around 25 years, the challenges that OEMs face given their size and structures often inhibit the business agility needed to provide lasting...

...
 

December 04, 2018


Forrester Blogs

DAM Or Web CMS? Part 2: Find Out Which DX Technology You Need For Workflow And Delivery

In the second part of our series, we take a look at workflow and delivery and where digital asset management and web content management systems excel. If you haven’t seen the first part of our...

...

Forrester Blogs

Emphasize Emotion In Your Holiday Customer Service

This blog post is part of Forrester’s Holiday 2018 retail series. As the holidays approach and the post-holiday return rush quickly follows, a few things will occur simultaneously: Hundreds of...

...

Forrester Blogs

Competition Heats Up In Banking Transformation Services

After a few quiet years, Thought Machine, the fintech behind a new core banking platform called Vault, is stealing the limelight, first with its announcement of a strategic partnership with Lloyds...

...
 

December 03, 2018


Forrester Blogs

Do Cities Need A “Smart City Platform”?

Do cities need a “smart city platform”? It depends. Clients have been asking Forrester about our thoughts on new IoT-enabled smart city platforms launched by vendors focused on transforming city...

...

Forrester Blogs

Hello, Indian Brands: Your CX Report Is Here

Dear Brand, It’s that time of year again. As we do each year, we went out and asked your customers what they think about the experiences they had with you, and the results are in. Forrester’s The...

...
 

November 30, 2018


Revolution Analytics

Because it's Friday: If planets were as close as the moon

What would the sky look like if Mars, Jupiter, Saturn, or Neptune were as close to us as the Moon is now? Well, other than the global calamity caused by extreme tides and general astrophysical...

...

Forrester Blogs

Marriott Breach: Starwood Hacker Gains Access To 500 Million Customer Records

Another Friday, Another Breach Announcement Today, Marriott announced that it uncovered four-plus years of a previously unknown, unexpected, and unauthorized data breach that includes travel details,...

...

Revolution Analytics

Simulating dinosaur populations, with R

So it turns out that the 1990 Michael Crichton novel Jurassic Park is, indeed, a work of fiction. (Personal note: despite the snark to follow, the book is one of my all-time favorites — I clearly...

...

Forrester Blogs

HCOs Need To Double Down On Virtual Care And Interoperability In The Post-CVS–Aetna Era

Shifts in consumer lifestyle and expectations have given innovators the opportunity to enter the healthcare space and capture market share with their modern, enhanced patient experiences. Instead of...

...
InData Labs

The Most Exciting Uses of Image Recognition That are Already Changing Our Lives

Image recognition has been a topic on our blog a few time before. It’s a technology that has not stopped gaining popularity for some time now, and we wanted to take a look at what other interesting and even non-conventional ways image recognition makes a difference in different industries today. Image Recognition in Healthcare Do...

Запись The Most Exciting Uses of Image Recognition That are Already Changing Our Lives впервые появилась InData Labs.

 

November 29, 2018


Forrester Blogs

Artificial Intelligence Has A Probability Problem

AWS just announced Amazon SageMaker Ground Truth to help companies create training data sets for machine learning. This is a powerful new service for folks who have access to lots of data that hasn’t...

...

Forrester Blogs

Online Retail In Southeast Asia Is Expected To Reach $53 Billion By 2023

Southeast Asia (including Indonesia, Singapore, Malaysia, Thailand, the Philippines, and Vietnam) is home to 574 million people. Of these, 272 million (47%) are online and 144 million will make...

...

Forrester Blogs

Outlook For 2019: The State Of Retail Payments Report

“The State Of Retail Payments — Outlook For 2019” is a biennial study by the National Retail Federation (NRF) and Forrester that profiles US enterprise retailers’ views and decisions about payments....

...
Knoyd Blog

Data Enrichment: More data is often the easiest way

When it comes to Data Science, the most recurring topic is modeling. Quite a few articles out there talk about data preparation and only a bunch about how to communicate your results properly. However, there are hardly any dealing with the topic that we are going to cover today: data enrichment.

In our experience with helping companies to start using their data efficiently, in most (especially bigger) organisations the single lowest hanging fruit to go after is a context. Many times the organisations attempt to solve a particular problem with data and fail.

“There is not enough data.”

“The models are not good enough.”

“We cannot do anything with these results.”

…are outcomes a lot of folks are way too familiar with. Sometimes the simplest thing to do is to step back and ask yourself: “What does this data mean in the context of other data?”

Let’s talk about the two main kinds of data enrichment: Data integration and data augmentation.

Data Integration

We understand data integration as combining together all the data that each part of your business generates. This may seem like a no-brainer, but data silos per department are very common in larger organisations and getting them all together can sometimes be a non-trivial exercise. We can see this happening in smaller companies as well, especially since the rise of subscription based SaaS services, which resulted in different tools being used for different teams and therefore all the data being replicated and scattered all over the place.


Talk to people:

If there is more than one person in your organisation, the odds are someone else knows something you don’t ;-) The differences in knowledge and skill sets are at the core of any modern company, yet a lot of times people act as if this wasn’t the case. Because of legacy organisational structures, acquisitions or independent initiatives, teams often end up in information bubbles, unaware of all the valuable insights and data that their colleagues might be sitting on. Our advice? Before starting a project, try to think what information can be useful and who would have an incentive to collect it in your organisation. The odds are, they are collecting it.

Assume things are not the same:

Just because things are called the same or seem like they should be representing the same piece of data, does not mean this is the case. Matching data between different data sources is the most crucial part of the process and it can make or break your analysis. There are plenty of reasons why client_id  might mean something different in the variety of databases you have across different teams. Be it a legacy way of assigning an ID to a client, different products or as simple as using bigint vs. int (different data types) for your data in different databases. Tying back to the step above, find someone who can clarify things and make sure your assumptions hold true.

Stitch Data:

One of the great tools for data integration available out there is Stitch Data. This service allows you to backup/save your data from different sources into a data warehouse of your choice in a nicely structured format. Whether it is Google Sheets or .csv files, SaaS applications or some custom events that you are collecting. They handle consistency, failures and maintenance so you don’t have to. Great for teams, that are short on development resources.

Data augmentation

We define data augmentation as getting new data that is not generated by your business in order to give context to the data you already have. This can mean spending on betting pages for credit scoring in a bank, getting weather data for a car insurance company or social media accounts of a customer for an ecommerce site.

APIs:

Many applications and web services today provide access to their data through an API. This is a way for a developer to process data from a service in a programmatic way without the use of a graphical user interface. Nowadays, there are APIs for everything - weather data, maps, electric grid information, social networks, fitness apps, communications tools, emailing tools, government organisations… you get where I’m going with this. If you can think of some information to augment your internal data with, odds are there is an API that can help you out.

Fullcontact:

Fullcontact is the best API to augment data about companies and physical people. Searchable by email, Twitter handle or other personal info, it provides publicly available information about the individual, identified from all over the internet. Their language and location, social networks on which you can reach them or topics they are fans of. You can make sure that you are communicating with people about relevant things, in a proper language and within correct channels.*

Zapier:

We love this product. Zapier is the best way to augment data, especially if you don’t have many development resources to spare. It enables you to connect thousands of apps together through their APIs using a drag & drop interface. No need to worry about errors, maintenance, deployments or updates to new version. Zapier handles all of that for you.

Footnote:
*we realise this information can be (and is being) misused by some. We in no way encourage this and are strong advocates of using additional data to make technology better.

Get in touch
 

November 28, 2018


Forrester Blogs

Deep Dive Into Data Commercialization At Forrester’s Data Strategy & Insights 2018 Forum

I’m often asked the question, “What’s my data worth?” And my immediate (and somewhat provocative) response is “Nothing!” Data is, without a doubt, valuable. But when stored in vaults and locked down,...

...

Forrester Blogs

Healthcare In 2019: Five Bold Predictions

The healthcare industry is experiencing some much-needed transformation and disruption. Looking ahead to next year, we made five bold predictions. Watch the video below to learn what is in store for...

...

Revolution Analytics

R now supported in Azure SQL Database

Azure SQL Database, the database-as-a-service based on Microsoft SQL Server, now offers R integration. (The service is currently in preview; details on how to sign up for the preview are provided in...

...

Forrester Blogs

Evaluating Digital Experience Agencies In Asia Pacific — Providers Have Deep But Diverse Capabilities

Firms across Asia Pacific are actively leveraging digital experience (DX) agencies to design, build, and manage digital customer experiences (CX). But to fully access these benefits, you need to...

...
 

November 27, 2018


Forrester Blogs

Dear IBM And Red Hat Customers . . .

Since IBM’s announced intention to purchase Red Hat for $34 billion, Forrester analysts have received a number of inquiries about the potential acquisition. Outside of our previously published...

...

Forrester Blogs

Insights To Action — For Real This Time

You’d be forgiven for thinking it’s a sequel — a “Part Deux,” so to speak. But in fact, this is Forrester’s inaugural Data Strategy & Insights event. So why “this time”?...

...

Forrester Blogs

Advanced Analytics Is Required To Win, Serve, And Retain Healthcare Customers

There is an imminent market need to understand which analytics vendors truly help healthcare organizations (HCOs) make sense of their growing data assets and turn them into customer-level insights....

...

Forrester Blogs

Video: 2019 Research Themes (And A Few Resolutions)

Like most people, I consider November a good time to start thinking about the clean slate that a new year brings. There is lots of room to reflect on one year and get ready for the next one with a...

...

Forrester Blogs

Experience Points: Level Up With Experience Level Agreements

We habitually use metrics, even when they are not relevant. Our instincts are based on deeply ingrained responses to situations and inherent investment biases (time and cost). This annoys many a...

...
 

November 26, 2018


Revolution Analytics

AzureVM: managing virtual machines in Azure

This is the next article in my series on AzureR, a family of packages for working with Azure in R. I’ll give a short introduction on how to use AzureVM to manage Azure virtual machines, and in...

...

Forrester Blogs

2018 Marked 20 Years Of Customer Experience Research At Forrester!

I didn’t want to leave 2018 without noting a milestone for all of us here at Forrester who cover customer experience. It was 20 years ago, all the way back in September of 1998, when we inaugurated...

...

Forrester Blogs

An Omnichannel Black Friday

US consumers have turned their attention from consuming turkey, cranberry sauce, and pie to starting their holiday shopping both online and in stores. The National Retail Foundation (NRF) predicts...

...
 

November 23, 2018


Revolution Analytics

Because it's Friday: Pavarotti v Mercury

This is Canadian performer Marc Martel, performing both parts of this "duet" between Freddy Mercury and Luciano Pavarotti. (Stay through the credits for a surprise about how the video was made.)...

...

Simplified Analytics

How to get your employees glued up on the Digital Transformation

Digital Transformation allows your organization to adapt to a competing business environment and meet sustainable growth goals.  We can change our technologies, our infrastructure, and our...

...
 

November 22, 2018


Forrester Blogs

Channel Software Tech Stack (2019) — INFOGRAPHIC

(CLICK FOR HIGH RES) The channel technology stack is a group of technologies that brands leverage to manage and improve their indirect sales processes and partner programs. Often, the focus of...

...
 

November 21, 2018


Forrester Blogs

Nine More Questions For 3M Industrial Business Marketing Leader Penny Wise

This year’s B2B Marketing & Sales Forum wrapped up just under a month ago, but I wanted to continue the great conversation I started with Penny Wise, marketing director of the Industrial Business...

...

Revolution Analytics

AI, Machine Learning and Data Science Roundup: November 2018

A monthly roundup of news about Artificial Intelligence, Machine Learning and Data Science. This is an eclectic collection of interesting blog posts, software announcements and data applications from...

...

Forrester Blogs

Tier It Up: A Winning Strategy For Customer Success Management Programs

When my B2B clients on Forrester’s Customer Experience (CX) Council first start considering a customer success management (CSM) program to boost their retention and enrichment, I often hear that...

...

Forrester Blogs

Google’s Plan To Fix Healthcare With AI

For years, Alphabet (Google’s parent company) has let its businesses build out their own healthcare solutions without imposing an overarching healthcare strategy. As a result, the company now has no...

...
 

November 20, 2018


Forrester Blogs

Five Vendors Lead In Forrester’s Inaugural “The Forrester Wave™: Unified Endpoint Management, Q4 2018” Evaluation

Our recently released evaluation, “The Forrester Wave™: Unified Endpoint Management, Q4 2018,” uses 28 criteria to evaluate the top 12 unified endpoint management (UEM) solutions on the...

...
 

November 19, 2018


Revolution Analytics

AzureRMR: an R interface to Azure Resource Manager

In a previous article I announced AzureR, a new family of packages for working with Azure from R. This article goes into more detail on how you can use AzureRMR, the base package of the AzureR...

...

Revolution Analytics

Cognitive Services in Containers

I've posted several examples here of using Azure Cognitive Services for data science applications. You can upload an an image or video to the service and extract information about faces and emotions,...

...

Forrester Blogs

Is Your Data Strategy Ready To Keep Up?

I remember a few years ago when, as enterprise architects, we sat around in the office of the VP of architecture and planned our data strategy on the whiteboard. Replace that clunky warehouse with a...

...

Forrester Blogs

The Forrester Wave™: Managed Security Services Providers (MSSPs), Europe, Q4 2018

I published my first Forrester Wave™ today, covering the managed security services provider (MSSP) market in Europe. The culmination of four months of hard work by not just us but all the vendors...

...
 

November 18, 2018


Forrester Blogs

Will CX Pros Still Have A Job In 2025?

There we were . . . a round table of CX leaders from across Southeast Asia, senior executives with years of experience running large, successful teams and chipping away at the journey to turn our...

...
 

November 16, 2018


Forrester Blogs

What SAP Customers Should Know About The $8B Qualtrics Buy

SAP has been on an acquisition binge in recent years (SuccessFactors, Hybris, Ariba, Concur, Fieldglass, CallidusCloud, Gigya, and others) — a marked departure from SAP’s build-it-all strategy of the...

...

Revolution Analytics

Because it's Friday: The physics of The Expanse

For a science fiction show set hundreds of years in the future, The Expanse is unusual in that it takes very few liberties with Science as we understand it today. The solar system is made up of the...

...

Forrester Blogs

The Fight For Cybersecurity Brand Dominance Intensifies

“Everything Is An Endpoint” Brings BlackBerry Back From The Dead For many, the fact that BlackBerry still exists — and the fact that it spent $1.4 billion of the $2.4 billion in capital...

...
decor