PresentationPDF Available

VIVO AS A LINKED OPEN DATA ENABLER FOR THE UNIVERSITÉ DU QUÉBEC NETWORK

Authors:

Abstract

Exposing interoperable Linked Open Data (LOD) in RDF notation is one of the six main use cases for the semantic web. Semantic web technology is the foundation of VIVO, and as such each VIVO installation can act as a source for LOD. However, the potential of LOD in VIVO remains relatively unexploited. The Université du Québec (UQ) is a network of 10 institutions throughout Québec, with over 102,000 students in some 1300 programs at the undergraduate and graduate levels. This presentation will cover the following: (1) a brief overview of the LOD needs in the UQ network and how these may be met with solutions based on VIVO; (2) VIVO functionalities that can be exploited in the context of LOD; (3) the integration, reuse and design of standardized open vocabularies contained in the Linked Open Vocabulary (LOV); (4) the design and integration of a competency vocabulary; (5) a description UQ's technological architecture, and specifically at the Université du Québec à Montréal (UQAM), the largest institution in the UQ network..
VIVO AS A LINKED OPEN DATA ENABLER FOR
THE UNIVERSITÉ DU QUÉBEC NETWORK
Michel Héon Université du Québec à Montréal
Nicolas Dickner , Université du Québec à Montréal
Alexander Jerabek , Université du Québec à Montréal
Rachid Belkouch , Université du Québec à Montréal
Exposing interoperable Linked Open Data (LOD) in RDF notation is one of the six main
use cases for the semantic web. Semantic web technology is the foundation of VIVO,
and as such each VIVO installation can act as a source for LOD. However, the potential
of LOD in VIVO remains relatively unexploited. The Université du Québec (UQ) is a
network of 10 institutions throughout Québec, with over 102,000 students in some 1300
programs at the undergraduate and graduate levels. This presentation will cover the
following: (1) a brief overview of the LOD needs in the UQ network and how these may
be met with solutions based on VIVO; (2) VIVO functionalities that can be exploited in
the context of LOD; (3) the integration, reuse and design of standardized open
vocabularies contained in the Linked Open Vocabulary (LOV); (4) the design and
integration of a competency vocabulary; (5) a description UQ's technological
architecture, and specifically at the Université du Québec à Montréal (UQAM), the
largest institution in the UQ network..
VIVOConference 2020
VIVOConference 2020
VIVO AS A LINKED OPEN DATA
ENABLER FOR THE UNIVERSITÉ DU
QUÉBEC NETWORK
Michel Héon
Nicolas Dickner
Rachid Belkouch
Alexander J. Jerabek
VIVOConference 2020
TheContext
UniversitéduQuébecNetworkOverview
TheUniversité duQuébec(UQ)isa:
Networkof10institutionsthroughout
Québec
Distributedin58cities
Withover102,000students
Insome1300programsatthe
undergraduateandgraduatelevels
7200professorsandlecturers
Université duQuébecàMontréal(UQAM)
isanimportantmemberoftheUQ
network.
VIVOAS A LINKED OPEN DATA ENABLER FOR THE UNIVERSITÉ DU QUÉBEC NETWORK VIVOCONFERENCE 2020‐ 2
UQandUQAMwishtouseVIVOasaplatformtodistributetheexpertiseprofileoftheiracademicstaff
VIVOConference 2020
PresentationObjective
PresentingthevisionofVIVO'suseasaLinkedOpenDatanodefor
theUniversityofQuébecnetwork
Thispresentationwillcoverthefollowing:
1. abriefoverviewoftheLODneedsintheUQnetworkandhowthesemaybe
metwithsolutionsbasedonVIVO;
2. VIVOfunctionalitiesthatcanbeexploitedinthecontextofLOD;
3. theintegration,reuseanddesignofstandardizedopenvocabularies
containedintheLinkedOpenVocabulary(LOV);
4. thedesignandintegrationofacompetencyvocabulary;
5. adescriptionofUQ'stechnologicalarchitecture,andspecificallyatthe
Université duQuébecàMontréal(UQAM).
3VIVOAS A LINKED OPEN DATA ENABLER FOR THE UNIVERSITÉ DU QUÉBEC NETWORK VIVOCONFERENCE 2020‐
VIVOConference 2020
What IsLinked Open
Data(LOD)?
Usecaseof thesemantic web
Project tobuild awebofdata
Entrypointforvocabularyreuseand
dataextensibility
1,255datasetsworldwideopenand
accessiblethroughaSPARQLendpoint
ahighvalueaddedfunctionalityof
researchdatainterconnectionforthe
VIVOcommunity
VIVOAS A LINKED OPEN DATA ENABLER FOR THE UNIVERSITÉ DU QUÉBEC NETWORK VIVOCONFERENCE 2020‐ 4
VIVOConference 2020
VIVOSemantic WebServer
VIVOasaSemantic WebServer
SemanticwebservercharacteristicsthatVIVOincludesinitsarchitecture
5VIVOAS A LINKED OPEN DATA ENABLER FOR THE UNIVERSITÉ DU QUÉBEC NETWORK VIVOCONFERENCE 2020‐
SPARQLEndpoint
RDFTripleStore HTTP(S)Communicationprotocol
RDF(S)DataNotation
IRI(URI/URL)ResourceAddressing
Ready forLinked OpenData
VIVOConference 2020
LinkedOpenVocabulary
(LOV):a'lovely'solution
forexpertisevocabulary
reuseatUQ
LOVisagatewaytoreusablesemantic
vocabulariesontheWeb
ForUQLOVisasolutionforobtainingor
publishingstandardizedexpertise
vocabulariesacrossinstitutions
VIVOAS A LINKED OPEN DATA ENABLER FOR THE UNIVERSITÉ DU QUÉBEC NETWORK VIVOCONFERENCE 2020‐ 6
https://lov.linkeddata.es/dataset/lov/
VIVOConference 2020
PublicURLpurl.org
APURLisapersistentURL
itprovidesapermanent
addresstoaccessaresource
ontheweb.
FortheUQprojectthePURL
normalizes:
theURLsforaccessingthe
serversontheweb
theIRIdomainofthedatasets
publishedbyVIVO.
E.g.:http://purl.org/vivo
dev.uqam.ca/crdc toaccess
theCRDCvocabulary
VIVOAS A LINKED OPEN DATA ENABLER FOR THE UNIVERSITÉ DU QUÉBEC NETWORK VIVOCONFERENCE 2020‐ 7
VIVOConference 2020
UQ&UQAM
LODCapability Map Explorer
UQLODProofOfConcept
8VIVOAS A LINKED OPEN DATA ENABLER FOR THE UNIVERSITÉ DU QUÉBEC NETWORK VIVOCONFERENCE 2020‐
VIVOConference 2020
POC's goal
Demonstratethatitispossibleto
publishonthewebaCapability
MapExplorerthatfederates
expertisenavigationbetweenUQ
networkandUQAMresearchers
POC‐ UQ&UQAMCapability Map Explorer
VIVOAS A LINKED OPEN DATA ENABLER FOR THE UNIVERSITÉ DU QUÉBEC NETWORK VIVOCONFERENCE 2020‐ 9
VIVOConference 2020
UQNetworkintheLODWithVIVO
TheHighLevelVision
10VIVOAS A LINKED OPEN DATA ENABLER FOR THE UNIVERSITÉ DU QUÉBEC NETWORK VIVOCONFERENCE 2020‐
VIVOConference 2020
CanadianResearchandDevelopment
Classification 2019(CRDCVocabulary)
VIVOAS A LINKED OPEN DATA ENABLER FOR THE UNIVERSITÉ DU QUÉBEC NETWORK VIVOCONFERENCE 2020‐ 11
https://www.sshrccrsh.gc.ca/fundingfinancement/nfrffnfr/crdcccrdeng.aspx
BR5
Diapositive 11
BR5 Ici il est important de dire que ce vocabulaire ne sort pas de nulle part et est une adaptation du standard
australien et néo-zélandais ANZSRC qui est lui-même considéré comme une des meilleures classification des
disciplines en recherche dans le monde
https://en.wikipedia.org/wiki/Australian_and_New_Zealand_Standard_Research_Classification
Belkouch, Rachid; 2020-06-15
VIVOConference 2020
DataTransformationProcess
VIVOAS A LINKED OPEN DATA ENABLER FOR THE UNIVERSITÉ DU QUÉBEC NETWORK VIVOCONFERENCE 2020‐ 12
VIVOConference 2020
CRDCVocabulary
Dataset
VIVOAS A LINKED OPEN DATA ENABLER FOR THE UNIVERSITÉ DU QUÉBEC NETWORK VIVOCONFERENCE 2020‐ 13
1
1. UQAMDEV
ontology editor
perspective:
ManagebyTopbraid
ComposerFE
2. CRDCClass
taxonomy
3. CRDCProperty
taxonomy
4. CRDCIndividuals of
RDF10201Class
5. CRDCResource
Form for
crdc:RDF1020101
2
3
4
5
VIVOConference 2020
UQAM–Expertand
Normalized Expertise
with CRDC
1. Locationof
Researcherrefencein
VIVOOntology
2. DescriptionofUQAM
expert
3. URIListof
Researcher's
Competencies
VIVOAS A LINKED OPEN DATA ENABLER FOR THE UNIVERSITÉ DU QUÉBEC NETWORK VIVOCONFERENCE 2020‐ 14
1
2
3
VIVOConference 2020
CrossLocalVIVOFederated SPARQLUpdate
Query BySERVICECall
prefix crdc-serv: <http://purl.org/ vivo-dev.uqam.ca/crdc/api/sparqlQuery?ema il=OO&password=OO>
prefix uq-serv: <http://purl.org/vi vo-dev.uqam.ca/expert-uq/api/sparqlQuery? email=OO&password=OO>
prefix uqam-serv: <http://purl.org/ vivo-dev.uqam.ca/expert-uqam/api/sparqlQu ery?email=OO&password=OO>
prefix expert-serv: <http://purl.or g/vivo-dev.uqam.ca/expertise/crdc/api/spa rqlQuery?email=OO&password=OO>
INSERT { GRAPH <http://vivo-i18n.dev. uqam.ca:8080/expertise/graph> {
?s ?p ?o .
?se ?pe ?oe .
?suq ?puq ?ouq .
?seuq ?peuq ?oeuq .
?scrdc ?pcrdc ?ocrdc .
}}
WHERE {{{
SERVICE uqam-serv: {
GRAPH <http://vitro.mannlib.cornell.edu /filegraph/abox/expert-uqam.ttl> {?s ?p ?o}
} }
UNION {
SERVICE uqam-serv: {
GRAPH <http://vitro.mannlib.cornell.e du/filegraph/abox/expertise-uqam.ttl> {?se ?pe ?oe}
} }
UNION {
SERVICE uq-serv: {
GRAPH <http://vitro.mannlib.cornell.e du/filegraph/abox/expertise-uq.ttl> {?suq ?puq ?ouq}
} }
UNION {
SERVICE uq-serv: {
GRAPH <http://vitro.mannlib.cornell.e du/filegraph/abox/expert-uq.ttl> {?seuq ?peuq ?oeuq}
} }
UNION {
SERVICE crdc-serv: {
GRAPH <http://vitro.mannlib.cornell.e du/filegraph/abox/crdc.ttl> {?scrdc ?pcrdc ?ocrdc}
} }
}}
VIVOAS A LINKED OPEN DATA ENABLER FOR THE UNIVERSITÉ DU QUÉBEC NETWORK VIVOCONFERENCE 2020‐ 15
VIVOConference 2020
VIVOUQUQAMCapability Map Application
VIVOAS A LINKED OPEN DATA ENABLER FOR THE UNIVERSITÉ DU QUÉBEC NETWORK VIVOCONFERENCE 2020‐ 16
http://purl.org/vivodev.uqam.ca/expertise
VIVOConference 2020
VIVOConference 2020
CONCLUSION
LOD is a cloud of RDF triples that makes accessible data sets, vocabularies,
ontologies and can be cross-referenced to feed specific applications.
A VIVO instance in the semantic web can also be a node in the LOD cloud
The domain data can be distributed in several VIVO instances and be
federated by a SPARQL Services query.
A VIVO instance on an institution's intranet is also a useful graph data source.
VIVOAS A LINKED OPEN DATA ENABLER FOR THE UNIVERSITÉ DU QUÉBEC NETWORK VIVOCONFERENCE 2020‐
VIVOConference 2020
VIVOConference 2020
VIVO
AS A
L
INKED
O
PEN
D
ATA
E
NABLER FOR THE
U
NIVERSITÉ DU
Q
UÉBEC
N
ETWORK
Michel Héon
Nicolas Dickner
Rachid Belkouch
Alexander J. Jerabek
Welcometothissessionontheuseofvivoinathecontextofthe
LinkedOpenDataandhowitisplannedtouseitIntheUniversité du
QuébecNetworkandUniversité duQuébecàMontréal
1
VIVOConference 2020
TheContext
UniversitéduQuébecNetworkOverview
TheUniversité duQuébec(UQ)isa:
Networkof10institutionsthroughout
Québec
Distributedin58cities
Withover102,000students
Insome1300programsatthe
undergraduateandgraduatelevels
7200professorsandlecturers
Université duQuébecàMontréal(UQAM)
isanimportantmemberoftheUQ
network.
VIVO
AS A
L
INKED
O
PEN
D
ATA
E
NABLER FOR THE
U
NIVERSITÉ DU
Q
UÉBEC
N
ETWORK
VIVOC
ONFERENCE
2020‐ 2
UQandUQAMwishtouseVIVOasaplatformtodistributetheexpertiseprofileoftheiracademicstaff
Let'sfirsttalkabouttheinstitutionalcontextwhichisattheorigin
oftheproject.Infact,theUniversités duQuébecisanetworkthat
includesadozeninstitutionsspreadacrossfiftyeightcitiesinthe
provinceofQuebec.
Itisattendedbyonehundredandtwothousandstudentswho
haveachoiceofonethousandthreehundredundergraduateand
graduateuniversityprogramswhosetrainingisprovidedbynearly
seventhousandtwohundredprofessorsandlecturers.
TheUniversité duQuébecàMontréal,whichisalsoamemberof
theUQnetwork,isalsoanimportantpartnerinthisproject.
UQandUQAMwishtouseVIVOasaplatformtodistributetheexpertiseprofileoftheir
academicstaff
2
VIVOConference 2020
PresentationObjective
PresentingthevisionofVIVO'suseasaLinkedOpenDatanodefor
theUniversityofQuébecnetwork
Thispresentationwillcoverthefollowing:
1. abriefoverviewoftheLODneedsintheUQnetworkandhowthesemaybe
metwithsolutionsbasedonVIVO;
2. VIVOfunctionalitiesthatcanbeexploitedinthecontextofLOD;
3. theintegration,reuseanddesignofstandardizedopenvocabularies
containedintheLinkedOpenVocabulary(LOV);
4. thedesignandintegrationofacompetencyvocabulary;
5. adescriptionofUQ'stechnologicalarchitecture,andspecificallyatthe
Université duQuébecàMontréal(UQAM).
3VIVO
AS A
L
INKED
O
PEN
D
ATA
E
NABLER FOR THE
U
NIVERSITÉ DU
Q
UÉBEC
N
ETWORK
VIVOC
ONFERENCE
2020‐
ThefirstobjectiveofthislectureistopresentavisionwhereVIVO
wouldbeanodeintheLODtobroadcastdatarelatedtothe
scientificexpertiseheldbyUQAMandUQresearchstaff.
Thepresentationwillcoverthefollowingpoints:
first,presentinganoverviewoftheLODneedsintheUQ
networkandhowtheycanbemetbyVIVObasedsolutions;
secondly,outliningthefunctionalitiesofVIVOandhowthey
canbeexploitedintheLODcontext
thirdly,explainhowtouseVIVOasastandardvocabulary
provideranddemonstratehowthesestandardvocabularies
canbepartoftheLOD.
Fourth,outlinetheprinciplesofskillvocabularydesignand
integration.
Finally,concludethepresentationwithaproofofconcept
onthetechnologyarchitecturedemonstratingtheVision.
3
VIVOConference 2020
What IsLinked Open
Data(LOD)?
Usecaseof thesemantic web
Project tobuild awebofdata
Entrypointforvocabularyreuseand
dataextensibility
1,255datasetsworldwideopenand
accessiblethroughaSPARQLendpoint
ahighvalueaddedfunctionalityof
researchdatainterconnectionforthe
VIVOcommunity
VIVO
AS A
L
INKED
O
PEN
D
ATA
E
NABLER FOR THE
U
NIVERSITÉ DU
Q
UÉBEC
N
ETWORK
VIVOC
ONFERENCE
2020‐ 4
WhatistheLinkedOpenData.
LinkedDataisoneofthefiveusecasesofthesemanticweb.
TheLODisitsversionenhancedbytheworldwideopeningofdata
ItisthefoundationofwhatTimBernersLeecallsthe"webof
data",thislargeglobalnetworkthatconsidersthewebasan
interoperabledatabaseofuniversalandinterconnecteddata.
Itisalsoapointofaccesstostandardized,reusableand
interoperablevocabularies.
Thecloudpresentedbelowisthecurrentrepresentationofthe
LOD.
Tod ay,itcontainsnearlyonethousandtwohundredandfiftyfive
datasetsdistributedaroundtheworld,coveringfieldsasvariedas
medicine,business,cultureandscienceingeneral.
Thesemanticwebistheonlytechnologythatisfreeofaccess
rightsandfreeofchargeandthatfacilitatesthesharingand
linkingofopendata.
Let'snoteinpassingthattheresearchdatapublishedbyVIVOis
compatiblewiththeLOD.
Finally,VIVOintheLODisahighvalueaddedresearchdata
interconnectionfeaturefortheVIVOcommunity.
4
VIVOConference 2020
VIVOSemantic WebServer
VIVOasaSemantic WebServer
SemanticwebservercharacteristicsthatVIVOincludesinitsarchitecture
5VIVO
AS A
L
INKED
O
PEN
D
ATA
E
NABLER FOR THE
U
NIVERSITÉ DU
Q
UÉBEC
N
ETWORK
VIVOC
ONFERENCE
2020‐
SPARQLEndpoint
RDFTripleStore HTTP(S)Communicationprotocol
RDF(S)DataNotation
IRI(URI/URL)ResourceAddressing
Ready forLinked OpenData
WhyisVIVOasemanticwebserver?
Firstofall,itstoresitsdatainaRDFtriplestorethatitmakes
accessiblethroughaquerywebservicecalledSPARQLendpoint.
TosharethisdatavivouseswebtechnologiessuchastheHTTP
protocoltotransmititsdatabyformingRDFusingIRIasthe
addressingmechanismforwebresources.
Finally,VIVOhasallthenecessaryfeaturestobeintegratedinto
theLOD.
5
VIVOConference 2020
LinkedOpenVocabulary
(LOV):a'lovely'solution
forexpertisevocabulary
reuseatUQ
LOVisagatewaytoreusablesemantic
vocabulariesontheWeb
ForUQLOVisasolutionforobtainingor
publishingstandardizedexpertise
vocabulariesacrossinstitutions
VIVO
AS A
L
INKED
O
PEN
D
ATA
E
NABLER FOR THE
U
NIVERSITÉ DU
Q
UÉBEC
N
ETWORK
VIVOC
ONFERENCE
2020‐ 6
https://lov.linkeddata.es/dataset/lov/
AnimportantuseoftheLODisthediffusionofopenstandardized
vocabulary.
TheLOVprojectisinfactthedatahubwhosedatasetsare
standardized,interoperable,andRDFformingvocabularies.
IntheUQproject,itisplannedtouseavocabularytobe
standardizedfromtheCanadiangovernment,whichwillbeput
intoRDFanddistributedintheLODwithLOV.
ForUQ,theLOVisanappropriateplatformtobroadcastthe
Canadianskillsvocabulary.
6
VIVOConference 2020
PublicURLpurl.org
APURLisapersistentURL
itprovidesapermanent
addresstoaccessaresource
ontheweb.
FortheUQprojectthePURL
normalizes:
theURLsforaccessingthe
serversontheweb
theIRIdomainofthedatasets
publishedbyVIVO.
E.g.:http://purl.org/vivo
dev.uqam.ca/crdc toaccess
theCRDCvocabulary
VIVO
AS A
L
INKED
O
PEN
D
ATA
E
NABLER FOR THE
U
NIVERSITÉ DU
Q
UÉBEC
N
ETWORK
VIVOC
ONFERENCE
2020‐ 7
AnimportantaspectofdatadisseminationintheLODisthe
standardizationofURLscontainedinthedataset.
Tobeprocessedbytheweban URLmustpointorredirecttoa
physicalwebserverwithauniqueURLaddressintheweb.
Thepurl.orgprojectallowstobuildpublicURLsthatareredirected
toaURLassociatedwithaphysicalwebserver.
Intherightsectionofthepresentation,itisnotedthatseveral
addressesfromUQAMserversareredefinedbyURLslabeledwith
publiccontent.
Thispracticemakesitpossibletodissociatethepublicvalueofthe
URLfromthephysicaladdresswherethedataisstored.
7
VIVOConference 2020
UQ&UQAM
LODCapability Map Explorer
UQLODProofOfConcept
8VIVO
AS A
L
INKED
O
PEN
D
ATA
E
NABLER FOR THE
U
NIVERSITÉ DU
Q
UÉBEC
N
ETWORK
VIVOC
ONFERENCE
2020‐
Atthisstageofthepresentationwewillpresentaproofofconcepts
called"UQ&UQAMLODCapability Map Explorer"
8
VIVOConference 2020
POC's goal
Demonstratethatitispossibleto
publishonthewebaCapability
MapExplorerthatfederates
expertisenavigationbetweenUQ
networkandUQAMresearchers
POC‐ UQ&UQAMCapability Map Explorer
VIVO
AS A
L
INKED
O
PEN
D
ATA
E
NABLER FOR THE
U
NIVERSITÉ DU
Q
UÉBEC
N
ETWORK
VIVOC
ONFERENCE
2020‐ 9
Thepurposeofthis section isdemonstratethatitispossibleto
publishonthewebaCapabilityMapExplorerthatfederates
expertisenavigationbetweenUQnetworkandUQAMresearchers
9
VIVOConference 2020
UQNetworkintheLODWithVIVO
TheHighLevelVision
10VIVO
AS A
L
INKED
O
PEN
D
ATA
E
NABLER FOR THE
U
NIVERSITÉ DU
Q
UÉBEC
N
ETWORK
VIVOC
ONFERENCE
2020‐
Thediagrambelowpresentsthevisionassociatedwiththeproof
ofconcept.
ToachievethegoalofthePOC,thefirststepistoextractunlinked
datafromthedifferentdatasourcesofdifferentformatsand
notation,andtransformthemintoRDFnotationtobestoredin
VIVOtriplestore.
Inthepresentedexample,threeexpertdatasourcesare
convertedandstoredinVIVOthatareontheinstitutions'intranet.
Thedatasetsarethuslinked,butnotopendata.
SomedatacomefromUQAMdatasources,otherscomefromthe
UQnetwork,whileothersmaycomefromotherinstitutions.
Asforthefourthdatasource,theycomefromtheexpertise
vocabularystandardizedbytheCanadiangovernmentanddirectly
disseminatedinavivothatisintheLOD.
Eventhoughitisnotpartofthisproofofconcept,theCanadian
vocabularycouldalsobeincludedintheLOD.
Thelastcomponentoftheproofofconceptistheoneintheright
handcolumn.
TheApplicationpopulatorisanapplicationthatisinfacta
federatedSPARQLquerythatlinksdatafromdifferentlocalVIVO's
toextractopendataandaggregateitintoavivothatwillbethe
FederatedLODApplication.
10
VIVOConference 2020
CanadianResearchandDevelopment
Classification 2019(CRDCVocabulary)
VIVO
AS A
L
INKED
O
PEN
D
ATA
E
NABLER FOR THE
U
NIVERSITÉ DU
Q
UÉBEC
N
ETWORK
VIVOC
ONFERENCE
2020‐ 11
https://www.sshrccrsh.gc.ca/fundingfinancement/nfrffnfr/crdcccrdeng.aspx
BR5
Let'stalkaboutstandardizedvocabulary.
TheCanadian Governmentisproposingastandardizedvocabulary,
called"CanadianResearchandDevelopmentClassification"
(CRDC).
Thegovernmentagencyopensthisvocabularywhichisstoredin
anExcelfileandisnotusableasitisintheLOD.
AstepoftransformationoftheExcelfileintoanRDFdatasetis
thereforenecessarytocompleteinordertopublishthe
vocabularyintheLOD.
11
Diapositive 11
BR5 Ici il est important de dire que ce vocabulaire ne sort pas de nulle part et est une adaptation du standard
australien et néo-zélandais ANZSRC qui est lui-même considéré comme une des meilleures classification des
disciplines en recherche dans le monde
https://en.wikipedia.org/wiki/Australian_and_New_Zealand_Standard_Research_Classification
Belkouch, Rachid; 2020-06-15
VIVOConference 2020
DataTransformationProcess
VIVO
AS A
L
INKED
O
PEN
D
ATA
E
NABLER FOR THE
U
NIVERSITÉ DU
Q
UÉBEC
N
ETWORK
VIVOC
ONFERENCE
2020‐ 12
Quickly!Theprocessoftransforminglocaldataintoopendatais
carriedoutinthreemajorstepswhicharerepresentedhereby
ovals.
Thefirststepistoextractthelocaldataandreplacethe
organizationalvocabularyskillswiththestandardizedskillsthat
areintheCRDC.
Oncetransformed,thedataistranslatedintotheTURTLEnotation
andreadyforprocessinginthenextstep.Steptwoistopublish
theTURTLEfilesinVIVOtriplestore whichareineach
organization'sinstitutionalintranet.
ProcessthreeisaSPARQLquerythatfederatesthedatacontained
inthesetoforganizationaltriplestore toextracttheopendata
aboutskillsandexpertisetopushthemintotheUQUQAM
CapabilityMaptriplestore,whichthenbecomesanodethatis
partoftheLOD.
12
VIVOConference 2020
CRDCVocabulary
Dataset
VIVO
AS A
L
INKED
O
PEN
D
ATA
E
NABLER FOR THE
U
NIVERSITÉ DU
Q
UÉBEC
N
ETWORK
VIVOC
ONFERENCE
2020‐ 13
1
1. UQAMDEV
ontology editor
perspective:
ManagebyTopbraid
ComposerFE
2. CRDCClass
taxonomy
3. CRDCProperty
taxonomy
4. CRDCIndividuals of
RDF10201Class
5. CRDCResource
Form for
crdc:RDF1020101
2
3
4
5
WewillfocushereonthepresentationoftheCRDCvocabulary.
TopresentthevocabularyweuseUQAMDEV,adevelopment
environmentthatallowsustodevelopvivocomponents.
NotethatUQAMDEVwillbepresentedinmoredetailinafurther
presentation.
AsadevelopmenttoolUQAMDEVincorporatesanothertool
calledTopBraidCo mpose F reeEditionwhichisinfactanontology
editor.
Thepresentation(inpointone)isdividedintoseveralviews.
Inpointtwo,weseethetaxonomyofthevocabularythatcomes
fromthetransformationoftheExcelfileofferedbytheCRDC.
Weseethetaxonomyoftheexpertiseclassescontainedinthe
vocabulary.
Viewthree,presentsthepropertiesassociatedwiththeCRDC
vocabulary,includingthehasLevel property.
Viewfourpresentsthesetofindividualscontainedintheclass
selectedinviewtwo.
Theselectedindividualispresentedintheforminviewfivewhere
thedifferentattributesassociatedwiththisvocabularyelement
canbeseen.
OnecanseethelabelofexpertisewhichisinbothFrenchand
EnglishCanadian.
13
VIVOConference 2020
UQAM–Expertand
Normalized Expertise
with CRDC
1. Locationof
Researcherrefencein
VIVOOntology
2. DescriptionofUQAM
expert
3. URIListof
Researcher's
Competencies
VIVO
AS A
L
INKED
O
PEN
D
ATA
E
NABLER FOR THE
U
NIVERSITÉ DU
Q
UÉBEC
N
ETWORK
VIVOC
ONFERENCE
2020‐ 14
1
2
3
Let'sseetogetherhowthedataistransformedafterthefirst
transformationstep.
Inthefirstviewwenoticethetaxonomyofthreeontologiesused
tostructurethedata:FOAF,OpenBiologicalandBiomedical
Ontologies andVIVO.
Inthisontology,thereareonehundredandfortyoneindividuals
ofFacultyMember type.
Thesecondviewistheformthatpresentsindividualn4503,who
isinfactaUQAMexpert.
Thereisasetofattributesandmorespecifically,inpointthree,
attributesthatarerelatedtotheexpert'sfieldofresearch.
WenotethattheyarestandardizedwiththeCRDCvocabulary.
14
VIVOConference 2020
CrossLocalVIVOFederated SPARQLUpdate
Query BySERVICECall
prefix crdc-serv: <http://purl.org/vivo-dev.uqam.ca/crdc/api/s parqlQuery?email=OO&password=OO>
prefix uq-serv: <http://purl.org/vivo-dev.uqam.ca/expert-uq/ap i/sparqlQuery?email=OO&password=OO>
prefix uqam-serv: <http://purl.org/vivo-dev.uqam.ca/expert-uqa m/api/sparqlQuery?email=OO&password=OO>
prefix expert-serv: <http://purl.org/vivo-dev.uqam.ca/expertis e/crdc/api/sparqlQuery?email=OO&password=OO>
INSERT { GRAPH <http://vivo-i18n.dev.uqam.ca:8080/expertise/grap h> {
?s ?p ?o .
?se ?pe ?oe .
?suq ?puq ?ouq .
?seuq ?peuq ?oeuq .
?scrdc ?pcrdc ?ocrdc .
}}
WHERE {{{
SERVICE uqam-serv: {
GRAPH <http://vitro.mannlib.cornell.edu/filegraph/abox/expert-uqam.t tl> {?s ?p ?o}
} }
UNION {
SERVICE uqam-serv: {
GRAPH <http://vitro.mannlib.cornell.edu/filegraph/abox/expertise-uqa m.ttl> {?se ?pe ?oe}
} }
UNION {
SERVICE uq-serv: {
GRAPH <http://vitro.mannlib.cornell.edu/filegraph/abox/expertise-uq.ttl> {?suq ?puq ?ouq}
} }
UNION {
SERVICE uq-serv: {
GRAPH <http://vitro.mannlib.cornell.edu/filegraph/abox/expert-uq.ttl> {?seuq ?peuq ?oeuq}
} }
UNION {
SERVICE crdc-serv: {
GRAPH <http://vitro.mannlib.cornell.edu/filegraph/abox/crdc.ttl> { ?scrdc ?pcrdc ?ocrdc}
} }
}}
VIVO
AS A
L
INKED
O
PEN
D
ATA
E
NABLER FOR THE
U
NIVERSITÉ DU
Q
UÉBEC
N
ETWORK
VIVOC
ONFERENCE
2020‐ 15
Hereisnowthequerythatallowstofederatethedatafromthe
differentinstitutionalserversinordertopopulatetheVIVOthat
willbeintheLOD.
Withoutgoingintothedetailsofthisqueryexplanation,let's
simplynotethatusingthe"service"command,thequerycan
delegatethesearchtoclientwebserversandthatitisthereforea
groupingofdatacomingfromaclusterofdataservers.
InthisquerywerefertotheCRDCexpertisevocabularyservers,
theUQAMandUQexpertiseontologyservers.Theserverwiththe
NameSpace expertservisinfact,theonethathoststheUpdate
command,meaningthatitupdatesthedatainitsowntriplestore.
15
VIVOConference 2020
VIVOUQUQAMCapability Map Application
VIVO
AS A
L
INKED
O
PEN
D
ATA
E
NABLER FOR THE
U
NIVERSITÉ DU
Q
UÉBEC
N
ETWORK
VIVOC
ONFERENCE
2020‐ 16
http://purl.org/vivodev.uqam.ca/expertise
ThisviewshowstheresultoftheVIVOUQUQAMCapabilityMap
application.
Infact,itpresentstheexpertisegrapharoundthe"ontology"
expertise.
Theapplicationidentifiesthreeprofessionalswiththisexpertise.
Intheviewontheright,therearetwoprofessionalswhobelong
totheUniversityofQuebecnetworkandoneprofessionalwho
belongstotheUniversityofQuebecinMontreal.
Thisisthereforeanapplicationthatfederatestheexpertiseof
professionalsfromseveralinstitutionswhosedatasourcesare
distributedinseveralVIVOsandthatusesastandardizedopen
vocabularytoidentifyexpertise.
16
VIVOConference 2020
VIVOConference 2020
C
ONCLUSION
LOD is a cloud of RDF triples that makes accessible data sets, vocabularies,
ontologies and can be cross-referenced to feed specific applications.
A VIVO instance in the semantic web can also be a node in the LOD cloud
The domain data can be distributed in several VIVO instances and be
federated by a SPARQL Services query.
A VIVO instance on an institution's intranet is also a useful graph data source.
VIVO
AS A
L
INKED
O
PEN
D
ATA
E
NABLER FOR THE
U
NIVERSITÉ DU
Q
UÉBEC
N
ETWORK
VIVOC
ONFERENCE
2020‐
In conclusion, let's just say:
That LOD is a cloud of RDF triples that makes
accessible data sets, vocabularies, ontologies and
can be cross-referenced to feed specific applications.
A VIVO instance in the semantic web can also be a
node in the LOD cloud
The domain data can be distributed in several VIVO
instances and be federated by a SPARQL Services
query.
A VIVO instance on an institution's intranet is also a
useful graph data source.
17
... Information sources are mainly focused on: (a) thesis and dissertations that Brazilian researchers produced; (b) Open Science outputs provided by data repositories and academic/ scientific databases; (c) patent databases; (d) scientific metric systems; (e) research profiles platforms; (f) scientific evaluation systems, and (g) directories of scientific journals. However, the system becomes something geared toward recommendations when the combination of several national and international open-access products is geared towards the consolidation of a single database with all the possible characteristics of interoperability and ontological articulation by the VIVO platform project (Börner et al., 2012;Heon et al., 2020). For a better organization of all the data gathered by BrCris, it was essential to discuss aspects like (a) data gathering, (b) data treatment, (c) data modeling, and (d) data availability. ...
Article
Full-text available
Objective. BrCris/IBICT is presented as a scientific and technological recommender system based on a collaborative information filter, enabling users to receive recommendations fitting their profile. The BrCris/IBICT is inspired by PTCRIS Project and LA Referencia and is designed to store, manage and exchange contextual metadata for research activity financed by Government Agencies. Therefore, it intends to present the BrCris/IBICT, which has as its central objective the construction and monitoring, through the interoperability of Brazilian science and technology data, in a single system, with capillary articles in specificities. Design/Methodology/Approach. The proposal and data modeling started in 2015 and matured as a handy tool in 2021. With semantic questions asked and all data finally certified, your recommendation systems proved to be an evolutionary process demonstrating the acquired know-how. The current research had an explanatory, descriptive approach for exploring, analyzing, and treating the data obtained in the tool in an explanatory way. Results/Discussion. As the main results, we can highlight that though the spectrum of recommendations is infinite, BrCris/IBICT presents four models to recommend relevant contents: scientific production, theses and dissertations; patents and innovation; and scientific editors. The future will prove how successful the interactions are, but the certifications will be continuous. Conclusions. BrCris, even though it is in the conclusion phase, already has an extensive consolidated dataset, representing consistent graphics on the dashboard, which may help the scientific community when it becomes publicly available.
ResearchGate has not been able to resolve any references for this publication.