Data unification via automated (!) data integration
Automateddata blending & data connectionsdata cleansing & deduplication& data enrichment
Questions? Contact us!
The engines that drive this process
Byusing‘eventualconnectivity’,alldata(structured,unstructuredandimageonlyfilesthatwillbeOCR-ed)frominternal andexternalsourceswillbecollectedbycrawlersandblendedautomaticallyviaaningeniousprocessbymeansofi.a.our mergingengine.Webuildconnecteddatawithoutaneedforhavingtoknowschemasupfront.Thedatablendingwillbe doneonthefly(duringthedataintegrationprocessitself)andrelationsbetweendatabeputinplaceautomatically.Graph technologyisattheheartofthisprocessbutsearchcluster,blobstore,relationanddistributedcachestoreareequally important for speed and overall functionality.Ourinferenceenginehelpsyoutoinferconnectionsoutofeventhedirtiestofdata.Toinferconnectionswilltakesome time, but provides better quality results.Ourweighteddecisionenginemakesdecisionsonlywhenitisstatisticallyconfidentthatadecisioniscorrect.Ifthe confidencelevelistoolow,wewaituntilmoredataisingestedandrevisitthisdecisionagain.Wecanshowyouwhy decisionsaretakenwhichalsoallowsourenginetolearnfromthedecisionstaken. Thisenginecontributestoconstantlyre-evaluating, updating and enriching your data. In fact, the more data that will be ingested, the higher the quality will become.Ourcleansingenginecleansesandnormalisesdata.Itwillcorrectspellingmistakes,andwillcorrectincorrectidentifiers such as emails phone numbers and addresses. For this the Smart Data Fabric uses i.a.:fuzzy merging of i.a names, companies and locationsnamed entity extraction for determining the statistical likelihood of matchesparse trees for understanding the context behind textexternal lookups for validating input.Thecleansingandformattingprocessisdoneautomatically.Withthisstep,yourdataispreparedoptimallyforfurtherdata processing the Smart Data Fabric does.Ourde-duplicationengineprovidesyouagenericwayofde-duplicatingabsolutelyanything,fromdocumentstotasks. TheSmartDataFabricconsolidatestheduplicatesandsimplyletyouknowaboutthedifferentlocationsofthesame documents.Ourreinforcementlearningallowsyouwithhumaninteractionandinput,tofurtherimprovethequalityofyourvaluable data.OnceyourdataflowsthroughtheSmartDataFabric,westreamsomequestionsthatneedtobeansweredsothe DataFabriccanlearne.g.regardingyourspecificproductnames.Withthis,ithelpstomake(future)decisionsonyour data.Ourprocessingengine(pipeline)isalargecombinationofprocessingstepstomakesenseofanytypeofdataandto cleanse and enrich it.Processesaresupportedbydashboardsandintuitiveinterfaces. Amongothertools,our18dataqualitymetricsallowyou toseethequalityofyourdatapermetric.Byadjustingthelevels,automatedtaskscanbeapprovedgroupwisebyyourdata engineersandyourdatastewardscansupportassignedtasks(yes/no-questions)aspartofthereinforcementlearning process.
Making unified data available: data streaming
IntheSmartDataFabric,allunifieddataisavailabletoyouasadatastream.TheSmartDataFabricusesgraph-based modellingandsupportsallusecases! Asmentionedabove,theSmartDataFabricutilisesfivedifferenttypesofdatabases allowingyoutomodelandprocessthedatayouneed.You“subscribe”toacertainsubsetofdata,andthatdatawillbe deliveredtotheapplicationorplatformyouuse.Newprocesseddatainyourenterprise,matchingthissubsetwillbe delivered near real-time. Every application will benefit from receiving “live” data and data that is increased in value.Similarfunctionalityissupportedby“keepmeintheloop”,whichallowsyoutoreceiveinformationnearreal-timeine.g. your mailbox, allowing you to act on this new and relevant information.TheSmartDataFabricunifiesdatainanautomatedmannerandcreateswiththisasoliddatafoundationfromwhichall dataisqueryable!Itcanstreamhigh-qualitydataforfurtherprocessing(analysis,datascience,BI, AI,innovationetc.).You havefullcontroloverhowyouwanttouseyourdata.TheSmartDataFabricsimply“returns”yourdatacleanerand enrichedinaflexibleway.Withthis,efficiencywillbeimprovedandtimefreeduptobespendonbusinessuse cases and better decisions can be taken!
Technically,alldatayouneedisstoredwithinyourorganisation.But,aslongasyourdataremainsscatteredinsilosacross multipledepartmentsandisn’tanalysed,itwillbeuseless.Unifyingdata,willdelivervaluetoyourorganisation.Unifieddata supports upstream data consumers like data scientists and analysts to run queries, to get all of the data they need.Unifyingdatafromacrosscomplexsystems,ishoweveroneofthehardesthurdlestotake.Manyenterpriseshaveover hundreds if not thousands of systems and using ETL is no go.
Unifying your data fully automated with the Smart Data Fabric
The Smart Data Fabric solves the most difficult challenge in data management:“How to unify data from across complex systems and data sources in an automated way?”Thefirststepistocollect(extract)data.Thisistheeasypart.Butjustcollectingdata,isn’tenough.Tomakedataunified, yourdataneedtobeconnected.Optimally,theresultshouldbe“goldenrecords”:trusteddatathatisaccurateandcorrect. Datathatyoucanrelyon.Toachievethis,theSmartDataFabriccreatesconnecteddataandimprovesthedataqualityby cleansingthedata,de-duplicatingandnormalisingthedataandcompletingemptyrecordsinanautomatedmanner.With thisuniqueandautomatedwayofdataintegration,itdoesn’tmatterifjustadozendatasourcesneedtobeintegrated or several thousand! Only the ingestion time will increase.
+31 (0) 252-225 466
Unified data
Unify your data with the Smart Data Fabric
Fullyautomated(!),theSmartDataFabriccollectsallofyourdataandputitintoonecentralplace,cleansit,deduplicates it, keeps your data updated and relevant constantly and makes it available to upstream consumers.
Unify data across your enterprise via automated data integration (even 1.000’s of sources)Make high-quality data available to everyone
Technically,alldatayouneedisstoredwithinyour organisation.But,aslongasyourdataremainsscatteredin silosacrossmultipledepartmentsandisn’tanalysed,itwillbe useless.Unifyingdata,willdelivervaluetoyourorganisation. Unifieddatasupportsupstreamdataconsumerslikedata scientistsandanalyststorunqueries,togetallofthedata they need.Unifyingdatafromacrosscomplexsystems,ishoweveroneof thehardesthurdlestotake.Manyenterpriseshaveover hundreds if not thousands of systems and using ETL is no go.
Unifying your data fully automated with the Smart
Data Fabric
TheSmartDataFabricsolvesthemostdifficultchallengein data management:“How to unify data from across complex systems and data sources in an automated way?”Thefirststepistocollect(extract)data.Thisistheeasypart. Butjustcollectingdata,isn’tenough.Tomakedataunified, yourdataneedtobeconnected.Optimally,theresultshould be“goldenrecords”:trusteddatathatisaccurateandcorrect. Datathatyoucanrelyon.Toachievethis,theSmartData Fabriccreatesconnecteddataandimprovesthedataquality bycleansingthedata,de-duplicatingandnormalisingthedata andcompletingemptyrecordsinanautomatedmanner.With thisuniqueandautomatedwayofdataintegration,itdoesn’t matterifjustadozendatasourcesneedtobeintegrated or several thousand! Only the ingestion time will increase.
Unify your data with the Smart Data
Fabric
Fullyautomated(!),theSmartDataFabriccollectsallofyour dataandputitintoonecentralplace,cleansit,deduplicatesit, keepsyourdataupdatedandrelevantconstantlyandmakesit available to upstream consumers.
Unify data across your enterprise via automated data integration (even 1.000’s of sources)Make high-quality data available to everyone
Byusing‘eventualconnectivity’,alldata(structured, unstructuredandimageonlyfilesthatwillbeOCR-ed)from internalandexternalsourceswillbecollectedbycrawlersand blendedautomaticallyviaaningeniousprocessbymeansof i.a.ourmergingengine.Webuildconnecteddatawithouta needforhavingtoknowschemasupfront.Thedatablending willbedoneonthefly(duringthedataintegrationprocess itself)andrelationsbetweendatabeputinplace automatically.Graphtechnologyisattheheartofthisprocess butsearchcluster,blobstore,relationanddistributedcache store are equally important for speed and overall functionality.Ourinferenceenginehelpsyoutoinferconnectionsoutof eventhedirtiestofdata.Toinferconnectionswilltakesome time, but provides better quality results.Ourweighteddecisionenginemakesdecisionsonlywhenit isstatisticallyconfidentthatadecisioniscorrect.Ifthe confidencelevelistoolow,wewaituntilmoredataisingested andrevisitthisdecisionagain.Wecanshowyouwhy decisionsaretakenwhichalsoallowsourenginetolearnfrom thedecisionstaken.Thisenginecontributestoconstantlyre-evaluating,updatingandenrichingyourdata.Infact,themore data that will be ingested, the higher the quality will become.Ourcleansingenginecleansesandnormalisesdata.Itwill correctspellingmistakes,andwillcorrectincorrectidentifiers suchasemailsphonenumbersandaddresses.Forthisthe Smart Data Fabric uses i.a.:fuzzy merging of i.a names, companies and locationsnamedentityextractionfordeterminingthestatistical likelihood of matchesparse trees for understanding the context behind textexternal lookups for validating input.Thecleansingandformattingprocessisdoneautomatically. Withthisstep,yourdataispreparedoptimallyforfurtherdata processing the Smart Data Fabric does.Ourde-duplicationengineprovidesyouagenericwayofde-duplicatingabsolutelyanything,fromdocumentstotasks.The SmartDataFabricconsolidatestheduplicatesandsimplylet you know about the different locations of the same documents.Ourreinforcementlearningallowsyouwithhuman interactionandinput,tofurtherimprovethequalityofyour valuabledata.OnceyourdataflowsthroughtheSmartData Fabric,westreamsomequestionsthatneedtobeanswered sotheDataFabriccanlearne.g.regardingyourspecific productnames.Withthis,ithelpstomake(future)decisions on your data.Ourprocessingengine(pipeline)isalargecombinationof processingstepstomakesenseofanytypeofdataandto cleanse and enrich it.Processesaresupportedbydashboardsandintuitive interfaces.Amongothertools,our18dataqualitymetricsallowyoutoseethequalityofyourdatapermetric.By adjustingthelevels,automatedtaskscanbeapproved groupwisebyyourdataengineersandyourdatastewardscan supportassignedtasks(yes/no-questions)aspartofthe reinforcement learning process.
Making unified data available: data streaming
IntheSmartDataFabric,allunifieddataisavailabletoyouas adatastream.TheSmartDataFabricusesgraph-based modellingandsupportsallusecases!Asmentionedabove, theSmartDataFabricutilisesfivedifferenttypesofdatabases allowingyoutomodelandprocessthedatayouneed.You “subscribe”toacertainsubsetofdata,andthatdatawillbe deliveredtotheapplicationorplatformyouuse.New processeddatainyourenterprise,matchingthissubsetwillbe deliverednearreal-time.Everyapplicationwillbenefitfrom receiving “live” data and data that is increased in value.Similarfunctionalityissupportedby“keepmeintheloop”, whichallowsyoutoreceiveinformationnearreal-timeine.g. yourmailbox,allowingyoutoactonthisnewandrelevant information.TheSmartDataFabricunifiesdatainanautomatedmanner andcreateswiththisasoliddatafoundationfromwhichall dataisqueryable!Itcanstreamhigh-qualitydataforfurther processing(analysis,datascience,BI, AI,innovationetc.). You havefullcontroloverhowyouwanttouseyourdata.The SmartDataFabricsimply“returns”yourdatacleanerand enrichedinaflexibleway.Withthis,efficiencywillbe improvedandtimefreeduptobespendonbusinessuse cases and better decisions can be taken!