View profile

DataScan: Issue #68

Revue
 
Can deep learning with synthetic data level the playing field? What is interenterprise data sharing?
 

DataScan

May 15 · Issue #68 · View online
Curated digest on the world of data.

Can deep learning with synthetic data level the playing field? What is interenterprise data sharing? Will GDPR will pop the adtech bubble? How has Google dodged the data privacy issue? 
🚀  Forwarded from a friend? Join my data digest for free.

Deep learning with synthetic data will democratise the tech industry. Evan Nisselson, partner at LDV Capital, explains how synthetic data can be used by smaller companies to compete with the major tech giants and their immense datasets. 💯 This fake but realistic data can be used to advance machine learning and help algorithms learn faster: 
Now, this advantage is being disrupted by the ability for anyone to create and leverage synthetic data to train computers across many use cases, including retail, robotics, autonomous vehicles, commerce and much more.

Synthetic data is computer-generated data that mimics real data; in other words, data that is created by a computer, not a human. Software algorithms can be designed to create realistic simulated, or “synthetic,” data.

This synthetic data then assists in teaching a computer how to react to certain situations or criteria, replacing real-world-captured training data. One of the most important aspects of real or synthetic data is to have accurate labels so computers can translate visual data to have meaning.
What is interenterprise data sharing? 🤔 Henrik Liliendahl explains the term, which had been referenced by Gartner on multiple occasions (including their recent research on the fundamentals for data integration initiatives), as:
In my eyes interenterprise data sharing is closely related to how you can achieve business benefits from taking part in the ecosystem flavour of a digital business platform.

Some of the data types where we will see such business ecosystem platform flourish will be around sharing product model master data and data about and coming from things related to the Internet of Things (IoT) theme.
GDPR will pop the adtech bubble. Great opinion piece by Harvard’s Doc Searls discussing the key differences between advertising and adtech, and how next week’s data legislation which affect the latter. ⛔
Searls raises the following “pro points” for consideration:
  • Don’t bet against Google
  • Do bet on any business working for customers rather than sellers
  • Do bet on developers building tools that give each of us scale in dealing with the world’s companies and governments
  • Do bet on publishers getting back to what worked since forever offline and hardly got a chance online: plain old brand advertising
How has Google dodged the data privacy issue? Larry Dignan points out how Google continues to give customers a return for sharing their data:
Whether it’s a helpful Google Assistant tidbit, unsolicited directions from Google Maps, a notification for your flight based on a Gmail entry and learning your screen habits over time, there’s a return on your data. Am I thrilled Google knows so much about me? Not really. Do I get value for sharing my information? You bet.
Are consumers as concerned about data sharing if it benefits them? 🔮
– On the other hand, Facebook has suspended 200 apps as part of investigation into data misuse and Apple is removing apps that share your location data with third parties.
Big data as the next public good? 🙌 Writing for the Washington Post, Bing Song explains why data should serve in the interest of society:
Treating data as a public good and regulating data aggregators as custodians of the public good will level the playing field for data access and exploitation, thus in the long run spurring sustained competition and innovation. This will not only counter monopolistic practices in different industries but also contribute to a more just distribution of wealth.
Miscellaneous
✅  What matters most during a data breach? How you react. Equifax has already spent $242.7 million on its data breach.
🏃  Fitbit and Google health data collaboration: What are the risks?
🎶  Spotify playlists in Bank of England’s sights to gauge consumer confidence.
🌟  How data is transforming the way astronomers make discoveries.
🎧  DataFramed - neat podcast presented by Hugo Bowne-Anderson.
📊  Great post by David Robinson on scientific debt.
🇮🇸  Video: Iceland could one day be the data capital of the world.
💥  Infographic: The importance of marketeers using data effectively and creatively.
💡  Data viz: Mapping how the United States generates its electricity:
Did you enjoy this issue?
If you don't want these updates anymore, please unsubscribe here.
If you were forwarded this newsletter and you like it, you can subscribe here.
Powered by Revue