Select Page

The Challenges of Connecting Multiple Data Sources in Akeneo PIM

By George Dzuricsko

December 3, 2021

With all PIM Implementation there is an ideal scenario. It’s simple and it follows the plan, but let’s be realistic: the complexity changes based on the details and our world is less than perfect. These challenges often add steps and options but they aren’t anything that can’t be handled with Akeneo and our expertise.

Data Sources for a “Vanilla” PIM

Traditional PIM data flows often contain 3 major systems:

ERP
PIM
E-Commerce Site

In these ‘Vanilla’ pim implementations, data governance is relatively straight-forward. Products are born in ERP, that data flows into PIM where attributes are marked as read-only and often grouped in an ERP attribute group, and then when a defined number of PIM required attributes are enriched the complete product flows downstream to a single destination.

The most common level of complexity that shows up for clients is multiple channels of data, which results in the creation of localizable and scopable attributes allowing for a single attribute to have multiple values. Added complexity in data usually leads to increased complexity in downstream exports, but that filtering is straightforward with proper pim configuration.

Multiple sources of data flowing into the PIM can lead to headaches, lost data, and worst of all, poor data quality and low trust in the system. Most of these pain points can be avoided by establishing clear ownership of data on an attribute group specific level.

For example, let’s say you have a PIM with four sources of product data:

ERP (includes Name, SKU, ManufacturerID, Manufacturer, ERP Description)
Translations Service (Own Spanish locale for all localized attributed in marketing team’s attributes ES_US)
Marketing Team (Long Description Product Specific Attributes including color, screen size, hard drive size, memory in English Locale only EN_US in all product categories)
DAM (product images, main image, manuals)

The four data sources (two teams and two automated imports from a DAM and an ERP) will have zero data collisions. In other words, because the value for the attribute can only be filled by one of the sources, there is never a question of the origin of the data or who should be tasked with cleaning it.

Introducing Greater Complexity to the Data Model

In an effort to improve data quality quickly and assist the marketing team, what happens if we add a subscription to pre-enriched product data? Let’s add a new feed, this time coming form 1WS for a subset of their products in the computer space in United State English locale only. What would that do to the data ownership?

ERP: No change
Translations Service: No change*
Marketing: Significant change

Both the Marketing Team and 1WS would be updating these values

DAM: No Change

There are a couple of ways to handle the potential collisions of the Marketing Attributes, but it is worthwhile to pause here to acknowledge that this scenario is already outside of a perfect world. Splitting responsibilities by category, locale, attribute group, or any of the other options in Akeneo is a cleaner way to handle this challenge. But reality is far from perfect, so my recommendation would be to take one of the following approaches:

Prefer human data to enriched
1. This puts some extra work on the integration team pushing data from 1WS into Akeneo, but it would first check to see if an attribute had an existing value. If there was a value, the product value for that attribute would not be added, to ensure human enriched values are preferred to an import.
2. The trade-off here is if 1WS data is more recent and/or better, it would be lost. Furthermore, because it was never written, there would not be a record in the history in Akeneo on the individual product.
Prefer latest update
1. Regardless of 1WS or human enriched, the latest data flows out. This would treat a 1WS update as any other user in the system.
2. The trade-off here is a user could have their content work overwritten. This may not seem like a big deal, but on a large scale basis preferring an automatic import over human enrichment can lead to distrust in the system and less clean data. AI and other teams are fully capable of having quality data, but I would prefer human to automated in most scenarios. That said, the Akeneo history would be available here for a user to audit on a single product basis to see where the higher effort lies.
Automate initial load of values, then lock out connector
1. This is similar to option 1, but it would allow for an initial write of product values on a product create, however, once the product data was written once successfully, it would not allow for additional writes.
2. The trade-off is you would get the benefit of only the initial data in 1ws. If there was a large scale update of product values of a product after that initial write, that data would not be saved. A user could be confident that if there was 1WS data in place, their manual enrichment would be protected.
Create attribute overrides:
1. Every attribute in this scenario could have a 1WS owned attribute and a corresponding override attribute that would only be human owned.
2. Trade-off: Massive duplication effort and if there was a poorly enriched human attribute, it would override the 1WS value every time.

With multiple avenues available to handle possible Market Attribute collision, this imperfect scenario can be resolved. No solution is without tradeoffs, and we work to ensure that our clients have all of the information required to handle these complex scenarios.

← Prev: Road Ready Wheels Partners with Sitation as Salsify Product Experience Management Implementation Partner Next: Akeneo - A Heuristic and Statistical Approach to Product Attribution using Legacy Data →

Case Study: Giant Tiger

Jul 11, 2024

When we first engaged with Giant Tiger, we discovered their efficiency was being hindered by siloed teams, manual...

Akeneo Serenity New Features for 2024

Jun 18, 2024

As Akeneo continues to roll out new platform upgrades, this spring’s release reflects the powerful tool’s focus on...

Sitation Wins Double Honors at Akeneo Unlock Boston 2024

Mar 12, 2024

We are thrilled to share that Sitation, a leading System Integrator in PIM/PXM and expert in digital merchandising,...

Privacy Overview

This website uses cookies to improve your experience while you navigate through the website. Out of these, the cookies that are categorized as necessary are stored on your browser as they are essential for the working of basic functionalities of the website. We also use third-party cookies that help us analyze and understand how you use this website. These cookies will be stored in your browser only with your consent. You also have the option to opt-out of these cookies. But opting out of some of these cookies may affect your browsing experience.

Necessary

Always Enabled

Necessary cookies are absolutely essential for the website to function properly. These cookies ensure basic functionalities and security features of the website, anonymously.

Cookie	Duration	Description
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.