DHQ: Digital Humanities Quarterly
Preview
2024
Volume 18 Number 3
Preview  |  XMLPDFPrint

Conceptual Modeling of European Silk Heritage with the SILKNOW Data Model and Extension

Abstract

Silk holds significant historical importance in European history, fostering economic growth, innovation in weaving techniques, and the creation of exceptional artifacts. Despite the conservation efforts of numerous institutions, silk heritage remains at risk due to its fragile nature. This paper outlines the methodology employed by the Horizon 2020 SilkNow project aimed at enhancing the promotion and preservation of silk-related cultural heritage collections. We present the development of a CIDOC CRM-based data model for the creation of a comprehensive knowledge graph. We also introduce the SilkNow extension, designed to encapsulate the intricate semantics associated with the production processes of silk fabrics. Our results demonstrate the potential of Semantic Web technologies in safeguarding and enriching the visibility of silk heritage through improved data interoperability and accessibility.

Introduction

For the past ten years, the history of fashion has attracted a growing public [Petrov 2019] [Stewart and Marcketti 2012]. But despite the interest shown in these collections, textile heritage remains under threat. This is particularly true of silk heritage, whose fragility poses major conservation problems and whose protection requires substantial investment. Few materials are of such historical, cultural and artistic importance; but a significant part of the silk-related collections are preserved by small institutions, lacking generally the means to promote their collections [Portalés et al 2018] [Gaitán et al 2019] [Pagán et al 2020]. Outside of the major events devoted to them, we also note the under-use of textile collections by visitors [Stewart and Marcketti 2012]. This is why the digitization of cultural heritage has become one of the crucial issues for its promotion [Eastermann 2015].
When users are interested in textile cultural heritage online, they are generally faced with two situations. The large generalist museums have the technical, financial and human resources to disseminate their data on the Web, but they do not systematically highlight their textile collections on their websites. Conversely, small museums do not have the same resources as their better-endowed counterparts [Claerr and Westeel 2010] to promote specialized collections online, which can be particularly rich. This is particularly the case for silk heritage, as the example of the Rhône-Alpes region clearly illustrates [Foron-Dauphin and Cano 2016] [Fournier et al 2016]. These institutions do describe their collections, but the digital data they produce is, most of the time, inaccessible online. Without a digital presence, these collections remain generally unknown, except to a few connoisseurs and specialists.
The Semantic Web offers particularly interesting prospects in this domain; it proposes to cultural heritage institutions to increase the visibility of their metadata on the Web by “freeing” it from catalogs; to aggregate this data with others and provide access to it in a federated way; to interconnect data with each other and, ultimately, to enrich this information more easily thanks to the links created between them. By encouraging the dissemination of structured data, Semantic Web technologies indeed facilitate the creation of links between dispersed historical sources [Meroño-Peñuela et al 2015]. The Semantic Web is not a new Web that replaces the current Web, but an extension of it [Berners-Lee, Hendler, and Lassila 2001], enabling navigation not between documents via hypertext links, but directly between data. This implies that all information is provided with a well-defined meaning, making it interpretable and reusable by data consumers, both human and machine, who will then be able to process it automatically and develop new applications and services [Bermès 2013]. Museums are particularly interested in the aggregation possibilities offered by Semantic Web technologies [Freire et al 2019], especially for heterogeneous metadata [Peroni, Tomasi, and Vitali 2013]. For example, the Europeana Fashion portal provides access to the collections of over thirty European institutions. However, it primarily aggregates data from major European museums only [Suls 2017], resulting in a notable exclusion of smaller museums. This approach neglects the promotion of more specialized collections, notably those focusing on silk. To address this deficiency, the H2020 SilkNow project[1] has developed ADASilk, an innovative exploratory search engine. This engine offers federated searches across diverse cultural heritage collections, with a specific emphasis on silk-related artifacts.
ADASilk is based on a knowledge graph that enables data from a variety of sources to be aggregated and presented online. A knowledge graph is a set of interconnected descriptions of entities - objects, people, events, concepts or situations - which can be enriched by a retrospective acquisition of information and integrated into ontologies [Gruber 1993]. This approach has proven particularly successful for other types of cultural heritage data: for example, bibliographic data describing musical works for the DOREMUS project [Achichi et al 2018], archival data for the ArchOnto project [Koch, Ribeiro, and Lopes 2020], or historical data for the WarSampo project [Hyvönen et al 2021]. The SilkNow Knowledge Graph represents the data contained in the catalogues of 21 cultural heritage institutions[2] , following a subset of the CIDOC Conceptual Reference Model or CIDOC-CRM. Collected online or provided directly by the institution concerned[3], the data describe objects that may be entirely made of silk, or incorporate other textile materials (natural fibers, metal threads), or be made of composite materials (e.g. wood in the case of furniture). The SilkNow project focused on objects produced or consumed in Europe between the mid-15th to the middle of the 19th century.
In this paper, we present the different steps that allowed us to build the SilkNow data model. We also describe the methodology we used to create a conceptual model extending the CIDOC CRM, and to integrate the annotations produced with machine learning methods. In a first part, we discuss the contributions of Semantic Web technologies for the protection and enrichment of silk heritage. In a second part, we explain the approach we adopted to develop the SilkNow data model. In the third part, we show how we used the Provenance Data Model or Prov DM to integrate the semantic annotations produced on these data thanks to the analysis of texts and images. We also propose to extend the CIDOC CRM with new classes and properties, so as to be able to express the complex semantics of the data describing the production process of silk fabrics.

Providing access to heterogeneous data using Semantic Web technologies

Heterogeneous metadata

The records produced by all these institutions generally provide the same types of information: type of object, place and date of production, materials and techniques used, dimensions, and illustration(s). But the way in which this metadata is structured can vary greatly depending on the practices of its producers. These data come from institutions using various cataloguing standards and respecting their own description standards. Figure 1 illustrates this problem by comparing the list of metadata describing two objects (here, two damasks[4]) held respectively by the Museu Tèxtil Terrassa [5] and by the Victoria & Albert Museum [6] . The same types of information are used to describe these two objects, but they are expressed differently by the two museums. For example, the Victoria & Albert Museum indicates in the same field the material and technique used, whereas the Museu Tèxtil Terrassa clearly separates this information.
side-by-side screenshots of museum webpages
Figure 1. 
Metadata describing the techniques and materials used: examples from the Museu Tèxtil Terrassa and the Victoria & Albert Museum
In other words, the Victoria & Albert Museum and the Museu Tèxtil Terrassa do not use the same data model. This is because the catalogues are designed as separate data silos, which are not supposed to communicate with each other. If one wants to collect this metadata on a large scale, it will therefore be difficult to process it automatically due to the heterogeneity of its structure, but also due to the lack of structure of the metadata. Different types of information are often found in the same field (as shown in Figure 1), which makes it difficult to automatically process the data. By nature, cultural heritage data are heterogeneous: they are produced by various institutions, they describe various objects - which does not make them easily accessible by search engines [Freire et al 2018]. To improve the discoverability of this data online, Semantic Web technologies offer the possibility of aggregating and providing access to metadata from different sources [Freire et al 2019]. This is in this context that the SilkNow project has created an exploratory search engine that enables a federated search of 21 collections dedicated to the conservation of European silk heritage.

An ontology to build a knowledge graph

Since the emergence of the Semantic Web, we have seen new changes in the use of the Web by museums, particularly the increasing openness of the data produced by these institutions [Marden et al 2013] [Eastermann 2015]. The Semantic Web implements linked data [Bizer, Heath, and Berners-Lee 2009], with the aim of enabling machines to understand the semantics of information published on the Web, and thus to create new, even unexpected, links between this information. The Semantic Web thus opens up new perspectives for the visibility of cultural heritage online, by making cultural heritage data available to all - and not only in a museum context -; by aggregating, publishing and enriching this data on the Web to facilitate its reuse; by linking this data to each other and to other data, so as to constitute an information network; and finally by making proposals for enriching this data.
SILKNOW hypothesized that the use of Semantic Web technologies, and more specifically the creation of a knowledge graph, could facilitate the integration, exploration and retrieval of data describing historical silk artefacts. By containing this data, the SilkNow Knowledge Graph is intended to create a single access point to this data. The implementation of the Semantic Web is based on distinct principles and standards, leading to what is known as the “Web of Data” or Linked Data [Bizer, Heath, and Berners-Lee 2009]. This framework is made possible by RDF, a graph model designed by the W3C to describe Web resources and their metadata. In RDF, each unit of information is represented by a three-element assertion (or triplet) comprising a subject, a predicate (or property) and an object. These elements are represented as URIs (Uniform Resource Identifiers), thus becoming “resources”. These resources are then organized into classes, conceptualized as collections of individuals or objects. The Web of Data doesn't establish fixed definitions for classes and properties; similarly, RDF's versatility doesn't dictate specific usage rules for classes and properties [Bermès 2013]. Consequently, the adoption of a knowledge representation technology becomes essential. Ontologies play a crucial role here, enabling the declaration of classes, properties, their behaviors and hierarchy in languages such as RDF Schema (RDFS) and Web Ontology Language (OWL). The selection of specific classes and properties from an ontology to represent the desired information leads to the creation of a data model [Bermès 2013].
An ontology is a mutually shared conceptual model used to represent all information in a specific application domain by means of defined hierarchical concepts [Gruber 1993]. Consequently, the use of ontologies helps to solve the problem of semantic heterogeneity arising from diverse data sources, and to improve the interoperability of systems and applications. With the expansion of the Semantic Web, ontologies have been developed for a variety of purposes [Wang, Xia, and Niu 2014]. In the cultural heritage domain, reference ontologies have been established to detail objects and their relationships. The CIDOC Conceptual Reference Model or CIDOC-CRM, used to describe objects in museums collections, is a striking example [Doerr 2003]. When a domain-specific ontology is available, it is possible to create more restricted subsets or specializations tailored to particular sub-domains [Messaoudi et al 2019] [Kergosien et al 2019].
In the framework of SilkNow, data is downloaded or collected from partner institutions; and these files are converted to create the knowledge graph using the Resource Description Framework as the data model. The graph, which contains all the metadata, is then loaded into a triple store. The converted data is then accessible online via ADASilk. The creation of the knowledge graph requires an RDF conversion, which is based on a manual mapping of the data. This mapping consists in putting in equivalence the data models describing similar objects [Christen 2012]. When working with two data sources A and B, data mapping can consist of matching each field in database A with those in database B. In the SilkNow project, we are using many different sources; we have chosen to use a pivotal data model that makes it easier to perform this mapping between all the collected datasets. We have thus created a subset of the CIDOC Conceptual Reference Model (CIDOC CRM) ontology, which is instantiated at the time of the RDF conversion [Schleider et al 2021b] [Schleider et al 2021c].

The SilkNow data model

Expressing the semantics of silk heritage information with the CIDOC CRM

We chose CIDOC CRM [Le Boeuf et al 2025] because it is a conceptual model specifically developed for cultural heritage by the International Committee for Documentation (CIDOC) of the International Council of Museum (ICOM). An ISO standard since 2006, renewed in 2014 and 2023[7], the CIDOC CRM is today a standard intended to ensure the interoperability of cultural heritage data. However, this standard is not set in stone; thus, at the time of writing this article, version 7.2.3 has been published[8] . However, we used version 6.2[9] - the most recent version at the time of the start of the project in 2018. CIDOC CRM is also a flexible and extensible model. If necessary, this model can be extended by creating new classes and properties to express new information, without changing the basic structure of the model. These features allow the creation of specialized extensions such as FRBRoo (Functional Requirements for Bibliographic Records), a conceptual model for bibliographic data[10], or CRMtex, a conceptual model for the study of ancient texts[11]. There are a total of eleven compatible models on the official CIDOC CRM website[12].
CIDOC CRM allows the underlying semantics of cultural heritage information to be expressed, and thus heterogeneous data to be modeled in a homogeneous way. If we take the descriptions of the two damask fabrics preserved by the Victoria & Albert Museum and the Museu Tèxtil Terrassa (Figure 1), we can express in a unique way with CIDOC CRM the information about the technique and the materials used during the weaving process. CIDOC CRM allows to model each of the descriptive fields with triplets Class (C) - Property (P) - Class (C), which can be represented as graphs as in Figure 2.
A graphical representation of an RDF graph showing how CIDOC classes (such as “E22 Man-Made Object” and
              “E38 Image”) are related through properties (such as “P138i has representation”) to construct full RDF
              statements in the form of triplets (for instance, “E22 Man-Made Object - P138i has representation - E38
              Image”)
Figure 2. 
Graph in accordance with the CIDOC CRM conceptual model
In this figure, we can see that we have chosen to use the E22_Man Made Object class to represent the object described by the museum metadata. In version 6.2 of CIDOC CRM, this class is indeed used to model “physical objects purposely created by human activity” [Le Boeuf et al 2025].

What classes and properties to represent this information?

As a conceptual model, CIDOC CRM is intended to model all kinds of cultural heritage data and should be able to cover most cases. CIDOC CRM thus provides a wide range of classes and properties. For example, there are 89 classes and 153 properties offered by version 6.2[13]. In general, therefore, we do not need to use all the classes and properties offered by CIDOC CRM.
To select the classes and properties needed to model the metadata collected by the project, it is first necessary to list the descriptive fields used to describe the textile objects. To do this, we analyzed and compared a large number of records, based on the documentation used by cultural heritage institutions to create this metadata (notably [Grant, Nieuwenhuis, and Petersen 1995]). This analysis allowed us to draw up a list of descriptive fields commonly used to describe textile objects, while eliminating descriptive fields that were not relevant to the project - namely, fields providing information on the administrative management of the object.
These descriptive fields were then classified into Information Groups, which form the Data Dictionary of the SilkNow project. These information groups allowed us to identify the categories used by cultural heritage institutions to describe ancient textiles. We defined 22 Information Groups, from the most commonly used (e.g. “Object Title Information Group” or “Object Measure Information Group”) to the most rarely encountered (e.g. “Object Missing Part Information Group” or “Exhibition Information Group”). These Information Groups then allowed us to rely on the Functional Overviews provided by the official CIDOC CRM documentation[14], and to more easily select the most appropriate classes and properties to represent these information groups. Functional Overviews classify CIDOC CRM classes and properties into information categories that can be easily aligned with the Information Groups in our data dictionary. For example, the Functional Overviews provide an “Object Title Information” category that corresponds to our “Object Title Information Group”.
The SilkNow data model consists of the classes and properties selected to represent the information groups we have previously defined. It is accessible and documented online [Puren and Vernus 2021b] thanks to OntoME [Beretta 2021], which is an ontology manager developed and maintained by the LARHRA laboratory. The SilkNow data model is mainly based on the CIDOC CRM Core version 6.2 (71 properties and 21 properties). We also selected two classes and one property from an extension of CIDOC CRM, the Scientific Observation Model (CRMsci) version 1.2.3[15] [Kritsotaki et al 2017]. The CRMsci, which is a conceptual model designed to integrate the metadata produced by scientific observations, seemed to us to be very appropriate for modeling the historical and technical analyses formulated by experts in the field on ancient textiles (Figure 3)[16]. We chose to use the S4_Observation class provided by the CRMsci extension because it is defined as “the activity of gaining scientific knowledge about particular states of physical reality gained by empirical evidence, experiments and by measurements” [Kritsotaki et al 2017]. The observations made by the experts on ancient textiles thus seemed to us to fall under this type of scientific activity.
A diagram showing a metadata record with a descriptive note, and an accompanying RDF representation of a
              portion of that record showing the use of the CIDOC class “S4 Observation” and property “P3 has note” to model
              that aspect of the metadata
Figure 3. 
Use of the S4 class Observation

Evaluating the ontology

The quality and robustness of this ontology is then assessed by a mapping process. This process consists of verifying that it is possible to represent all the metadata fields using the classes and properties selected in the ontology. The mapping is produced manually by domain experts, who then provide mapping rules for each of the collected datasets. Concretely, we chose two objects described extensively[17] in the catalogues of the selected institutions. Then, we interpreted each of the descriptive fields in the form of triplets Entity (E) - Relation (R) - Entity (E). To do this, we relied on the collected metadata files, converted into an intermediate JSON format. Example 1 shows an extract corresponding to the metadata contained in Table 1.

"fields": [
  {
    "label": "title",
    "value": "lé de tenture"
  },
  {
    "label": "Désignation",
    "value": "lé de tenture"
  },
  {
    "label": "N° d'inventaire",
    "value": "VMB 14527"
  },
  {
    "label": "Domaine",
    "value": "Textiles"
  },
  {
    "label": "Date de création",
    "value": "XVIIIe siècle"
  }
]
Example 1. 
Extract from a JSON metadata file
As shown in Table 1, we have therefore decomposed the whole metadata schema with Class (C) - Property (P) - Class (C) triples from the SilkNow data model.
Field Value Value Path
Désignation Lé de tenture E22_Man-Made Object P102_has title E35_Title
N° d'inventaire VMB 14527 E15_Identifier Assignment P140_assigned attribute to E22_Man-Made Object
E15_Identifier Assignment P37_assigned E42_Identifier
E15_Identifier Assignment P14_carried out by E40_Legal Body (Château de Versailles)
E42_Identifier P2_has type E55_Type (N° d’inventaire)
Date de création XVIIIe siècle E12_Production P4_has time-span E52_Time-Span
E52_Time-Span P78_is identified by E49_Time Appellation
Table 1. 
Mapping rules in the SilkNow data model
We then represented these triples as graphs[18] as in Figure 4. These graphs allow us to visualize the triplets and to check their consistency more easily.
A graphical representation of RDF triplets modeling the metadata record shown in Figure 3
Figure 4. 
Triplets (extracts) used to model the metadata of the record shown in Figure 3
After providing these mapping rules, we found that all the metadata fields could be expressed using the SilkNow data model. The JSON files were then converted to create the knowledge graph using an RDF converter. The graph, which contains the metadata describing 36,210 objects illustrated by 74,527 images, is finally loaded into a triple store, while the images are uploaded separately to a media server. All data is available via a faceted browser (https://data.silknow.org/fct/) and a SPARQL endpoint (https://data.silknow.org/sparql). This browser can also be used via a graphical user interface which provides access to the SilkNow multilingual thesaurus dedicated to the vocabulary of historical silk textiles[19] (see “The SilkNow extension”). All the data collected is accessible with the exploratory search engine ADASilk (Advanced Data Analysis for Silk Heritage [20]) [Schleider et al 2021b]. Figure 5 shows the record describing the tapestry preserved in Versailles, as it appears in ADASilk[21].
screenshot of ADASilk web-interface with image and metadata
Figure 5. 
Example of a record in ADASilk

SILKNOW's contribution to textile heritage: the example of religious fabrics

The benefits of using this type of technology are particularly apparent when it comes to religious textiles. Religious textile collections are preserved in a wide variety of institutions. While collections of “ordinary” textiles are generally preserved in fashion museums, they can also be found in ethnological or decorative art museums. Religious textiles are preserved not only in national heritage institutions, but also in small and medium-sized ecclesiastical museums. In addition, churches, cathedrals and convents may use these fabrics as part of their liturgy and daily devotions. There are also collections preserved by companies that produced these fabrics: such is the case of the Garín archives - one of the SilkNow consortium's partners - which conserve very rich samples of religious fabrics representative of Spanish silk manufacturing in the 18th and 19th centuries [Alba et al 2020]. The integration of the inventories produced by these structures into the SilkNow Knowledge Graph enables them to be displayed via ADASilk, thus improving the accessibility and visibility of their collections.
In some cases, small heritage structures do not have a catalog or even an inventory, which makes it extremely difficult to determine the origin of the fabrics kept there. These fabrics, widely dispersed across Europe, are even incomplete[22]. As a result, information is fragmented and disconnected, making it difficult in many cases to correctly identify and date these artifacts, and hence to catalogue them. ADASilk enables smaller organizations, particularly church museums and religious institutions, to trace the links between the objects they conserve and other artifacts held in distant collections, and to integrate them into a renewed chronology. This can help create new synergies for ecclesiastical collections, cathedrals and parishes that are custodians of this heritage [Alba et al 2020].
Small parishes and diocesan museums often lack the resources — both financial and human — to properly conserve these works. This information is also indispensable when these establishments want to restore the fabrics they conserve, as they rarely have experts capable of carrying out the appropriate research. This results in a lack of appropriate methodology and restoration that should stem from a thorough knowledge of weaving techniques, materials, design and historical and artistic links in order to better preserve these assets [Alba et al 2019]. ADASilk enables them to compare the fabrics they conserve with those in other collections, and thus obtain the information they need to launch better-documented restoration campaigns. It can be very difficult to distinguish the origin and technique of certain fabrics without applying a technical analysis which, in some cases, leads to the destruction of part of the textile [Alba et al 2020]. Comparing fabrics held in other collections - and already catalogued - is therefore particularly useful when it comes to analysing and identifying old and rare fabrics without having to damage them.
In addition to integrating heterogeneous data into the knowlegde graph, the SilkNow ontology provides a structure for automatic image recognition, facilitating the discovery of similarities between several textiles preserved in various European institutions. Image recognition performed on images from Garín's collections and images from public museums [Dorozynski, Clermont, and Rottensteiner 2019] [Clermont et al 2020] enabled [Alba et al 2020] to highlight the great similarities between an 18th-century dalmatic preserved by the Centro de Documentación y Museo Textil (CDMT) in Terrassa (Spain) and Garín's 19th-century Francia model. A comparison of these two objects through their motifs reveals that Garín drew inspiration from Lyonnais motifs for its religious textiles. These images help us to understand the link between Valencian and French motifs. On the other hand, during the 19th century, silk industries turned to revivals, reembracing ancient motifs that may have appeared obsolete and reinterpreting them to suit the tastes of each period. This complicates the process of dating and attribution. Sometimes, textiles found in geographically distant locations exhibit strikingly similar features; and the same fabric design can be observed in both secular and ecclesiastical garments. These confusing scenarios prompt researchers to question whether these designs are original or, alternatively, imitations of other textiles.

Integrating annotations into the data model

Prov DM: Provenance Data Model

SILKNOW has developed text [Massri and Mladenić 2020] [Schleider and Troncy 2021] and image [Dorozynski, Clermont, and Rottensteiner 2019] [Clermont et al 2020] analysis methods that infer new properties about objects from the metadata describing them, so as to enrich the existing metadata. If we examine the record shown in Figure 5, we find the following information in the “Technique” field: Damask 71% and Embroidery 52% and in the “Depiction” field: Floral motif 51% and Vegetal motif 57% (see Figure 6).
an example of  image analysis
Figure 6. 
Fabric predictions and associated confidence scores: example
This new information was obtained using convolutional neural networks trained on the data (texts and images) contained in the knowledge graph. Based on textual descriptions of silk objects or on images representing them, text [Rei 2021] or image [Dorozynski and Rottensteiner 2021] classification models are able to predict the values of certain properties: the period of production, the place of production, the technique and the material for the text classification model; the previous properties, plus the patterns represented for the image classification model. Each of these predictions is associated with a confidence score expressed as a percentage. In the previous example, the text classification model predicted the damask value with a confidence score of 71% for the “Technique” property; the image classification model predicted the floral motif value with a confidence score of 51% for the “Depiction” property.
It was crucial to incorporate this new information into the data model, so that it could be integrated into the knowledge graph. We also aimed to make a clear distinction between these predictions and the original data. We also want to provide ADASilk users with information on the degree of reliability of the results of these analyses. To model these annotations, we have chosen to use the conceptual data model Provenance Data Model (Prov DM), recommended by the W3C [Belhajjame et al 2013]. Figure 7 shows the entities and relationships from Prov DM that we used in graph form[23] .
A graphical representation of RDF triplets reflecting the inclusion of image analysis information using
              entities and relationships from the Provenance Data Model
Figure 7. 
Integration of image analysis data with ProvDM
Image or text analysis is represented as a Prov:Activity which can be qualified by a type (image analysis or text analysis). Depending on the case, this Prov:Activity takes an E38_Image (image analysis) or a text or E62_String (text analysis) as input and produces two declarations as output. In this model, the date of the analysis can be specified ; and if necessary, the analysis module can be specified with a prov:Agent class (of type Software Agent).

The SilkNow extension

During our examination of the metadata, we noticed that the technical and historical analyses written in free text were particularly rich. These sometimes very long blocks of text provide information about the weaving process, such as which weave(s)[24] , or which type(s) of yarns were used. However, this semantic richness is difficult to capture in the CIDOC CRM[25]. To integrate these descriptions into our data model, we use the P3_has note property, which is “a container for all informal descriptions about an object that have not been expressed in terms of CRM constructs” [Le Boeuf et al 2025]. It is used to model the way in which an object is characterized (e.g. its appearance, structure, etc.), but it does not allow to express “in a structured form, everything that can be said about an item”. This is one of the consequences of the CIDOC CRM formalism which does not have the function of capturing everything that can be said about an object.
These blocks of free text provide particularly precise information on the techniques used to produce the textiles described; often, they also provide detailed iconographic descriptions. The textual analysis carried out on these texts has thus made it possible to extract technical information (weaves and weaving techniques) and other information relating to iconographic description (motifs and styles), as illustrated in Figure 8. These different types of information may be of interest to the general public looking for particular patterns, textile specialists working on the history of techniques, or designers and creative industries wishing to draw inspiration from a historical style. Access to this information via a search engine would also offer a richer user experience by enabling the user to perform more refined queries on the data.
When the data is integrated into the knowledge graph, some information (techniques and patterns) is annotated using the SilkNow thesaurus (for more information on this thesaurus, see [Léon et al 2019]). This semantic enrichment is achieved in the following way: if the value of a character string for a technique or pattern can be associated with one of the character strings proposed by the thesaurus, this value is replaced by a unique identifier. Modeling these concepts by means of a formal language would make it possible to integrate these annotations into the knowledge graph, to give access to this new data via the exploratory search engine, and to create links between these concepts and the thesaurus more easily.
A diagram showing semantic enrichment through thematic annotation of object descriptions extracted from
              various museum catalogues, identifying phrases that identify weave, motif, style, and weaving
              technique
Figure 8. 
Types of information extracted from the descriptions: examples from the catalogues of the Palais Galliera, the Victoria & Albert Museum and the Mobilier National
The inherent flexibility of CIDOC CRM allows for the creation of new classes and properties to express new kinds of information. The aim is not to capture everything that can be said about an object, but to express some of the things that have been said about it - in this case, information about the process that led to the production of a silk fabric. Based on the results of the text analysis and the resulting data annotation, we created a CIDOC CRM-compatible model to formally express the process of creating and producing a silk fabric. This specialized extension, accessible via OntoME, offers 23 classes and 12 properties [Puren and Vernus 2021a]. To create this extension, we adopted a bottom-up approach, relying primarily on the examination of collected data. We also worked with domain experts to verify the validity of these new classes and properties.
Among these 23 classes and properties is class T1_Weaving, which is a subclass of E12_Production. T1_Weaving refers to the activity of interlacing the warp threads (the threads along the length of the fabric, which are stretched on the loom) and the weft threads (the threads that run alternately between the warp threads). This activity is usually carried out using a loom[26]. The product of the weaving is a fabric, represented by the class T7_Fabric, itself a subclass of E22_Man-Made Object. The production of this fabric is carried out following a technical procedure(s) (T25_Weaving Technique). The weaving process always includes the use of at least one specific weave (T21_Weave). It then produces a fabric that is often decorated with patterns (T18_Motif), sometimes characteristic of a style (T11_Style) identified by experts in the field (T13_Style Assignment). In Figure 9, we express this information with these new classes and associated properties, for a set of tapestries for the Grand salon of the apartment of the Empress[27] in Versailles[28] kept in the collection of the French Mobilier national (see the last description on Figure 8).
A graphical representation of RDF triplets showing details of the T1 Weaving class and its relationship
              to other T classes
Figure 9. 
CIDOC CRM and SilkNow extension compliant graph
As mentioned above, these new classes and properties facilitate the integration of semantic enrichments into the knowledge graph and allow the end user to query these annotations, but also to have access to additional contextualization elements. As shown in Figure 10, the T32_Weave Type class, for example, is used to express the different weaves used during the weaving process and defined in the SilkNow thesaurus.
A graphical representation of RDF triplets showing details of values in the T32 Weave Type class,
              including the accommodation of values in multiple languages
Figure 10. 
Using the T32_Weave Type class
We have done the same for the information on patterns by creating the class T34_Pattern Type. This class should also make it possible to create links with the definitions available in the thesaurus. For example, the user could consult the definition of the Palme motif[29] , represented on the tapestries held in the collections of the Mobilier national.

Conclusion

Silk heritage is of crucial importance for understanding European history and culture. But some conservation institutions lack the means to promote it and make it accessible to the public. More generally, silk heritage suffers from a lack of visibility, especially online, which has become the preferred tool for searching and discovering new information. Semantic Web technologies, however, offer the possibility of disseminating cultural heritage data more easily on the Web by improving its discoverability, but also of enriching this data by linking it to other information.
The SilkNow project wished to exploit these potentialities to enhance the visibility of the European silk heritage, by developing the exploratory search engine ADASilk dedicated to these specialized collections. ADASilk is based on a knowledge graph, whose creation is based on the SilkNow ontology - a subset of the CIDOC CRM and CRMsci ontology. The richness of the metadata contained in the knowledge graph has also led us to develop the SilkNow extension, a conceptual model for the description of ancient silk textiles. We believe that this specialized extension represents a first step in the development of a domain ontology for textile cultural heritage. Data reusers indeed play a vital role in data modeling. They can suggest collaborative and shared models to ensure this data is interoperable and completely reusable in the context of the Semantic Web. We have thus conceived the SilkNow extension as a first step towards the creation and dissemination of a new compatible model compatible with the CIDOC CRM SIG. To achieve such recognition, it is essential to extend the SilkNow framework to create an ontology that encompasses not only the results of textile manufacture, but also the places and methods of production, the circumstances surrounding this production and the memories associated with it. In addition, we have placed special emphasis in this article on the methodology we have used to create this specialized extension, so as to enable other research projects and institutions to adopt the same approach. We believe that such an approach could be systematized when the aim is to promote and protect an endangered heritage using the Web.
By creating the ADASilk exploratory search engine, SilkNow has demonstrated the value of using Semantic Web technologies to aggregate heterogeneous heritage data, provide access to it and increase its visibility. It follows in the footsteps of other projects that also aim to promote fragile and endangered heritage and demonstrates the validity of this type of approach for large-scale processing of cultural heritage data. SilkNow also illustrates the important work that needs to be done on heritage data if the objectives of the Semantic Web are to be achieved. But to bring this kind of project to fruition, a strong institutional commitment is required. Not only from the point of view of producing open, high-quality data, which is the essential fuel for this type of project, but also from an organizational and technical point of view. Such a commitment is not accessible to small and medium-sized conservation institutions [Sebastián-Lozano et al 2023]. Indeed, such projects require significant investments, and therefore remain inaccessible to small institutions, which would be the first to benefit from them. Having demonstrated the value of these approaches, it is therefore necessary to consider the development of tools to access these technologies, and thus make them accessible to the greatest number of people.
The integration of data into the SilkNow Knowledge Graph could be considerably facilitated by the use of automatic classification tools, based on the exploitation of Linked Open Data. Case studies have demonstrated the value of developing such methods for creating links (or properties) between entities [Bianchini and Bargioni 2021]. But they have also shown the need for semantic data to be available from a high-quality standard and as complete as possible [Freire, Manguinhas, and Isaac 2020] [Bianchini and Bargioni 2021]; without this, crucial aspects of the data may be left out, and links between data may thus be missing. We must be aware, however, that the diversity of cataloguing practices - and sometimes errors in metadata - as well as the inherent complexity of historical data, often unstructured or a little structured [Meroño-Peñuela et al 2015], can be a brake on this kind of approach, both from the point of view of automatic classification and the development of a knowledge graph based on CIDOC CRM.

Notes

[1] Silk Heritage in the Knowledge Society: from punched card to Big Data, Deep Learning and visual/tangible simulations: https://silknow.eu/.
[2] A full list of these institutions can be found here: https://ada.silknow.org/fr/museums.
[3] Datasets have been made available to the project by Garin1820, the Sicily cultural heritage and the Musée d'Art et d'Industrie de Saint-Etienne.
[4] For a definition of damask, please consult the SilkNow thesaurus: https://skosmos.silknow.org/thesaurus/en/page/168.
[10] IFLA Study Group on the Functional Requirements for Bibliographic Records, Functional Requirements for Bibliographic Records: Final Report, IFLA, 2009. URL: https://repository.ifla.org/handle/123456789/811.
[11] Martin Doerr, Francesca Murano and Achille Felicetti, Definition of the CRMtex. An Extension of CIDOC CRM to Model Ancient Textual Entities, Version 1.0, currently maintained by Francesca Murano and Achille Felicetti, June 2020. URL: https://cidoc-crm.org/crmtex/ModelVersion/version-1.0-0.
[12] For the full list: https://cidoc-crm.org/collaborations.
[13] A list of these classes and properties can be found on the OntoMe website: https://ontome.net/namespace/1
[14] The functional overviews are available online: http://www.cidoc-crm.org/functional-units.
[15] The latest version 1.5 was released in January 2022.
[16] For example, the record describing a tapestry (lé de tenture) preserved in the collections of the Château de Versailles includes a “History” field (Historique) and a “Comment” field (Commentaire). Using the CIDOC CRM and CRMsci models, we have expressed this information with the following triplets: S4_Observation O8_observed E22_Man-Made Object; S4_Observation P2_has type E55_Type (Historique / Commentaire); S4_Observation P3_has note E62_String.
[17] That is, with the highest number of descriptive fields filled in the records.
[18] On Figure 4, we visualize the triplets modeling the information on the production conditions of the tapestry conserved at the Château de Versailles, namely the production date ( E12_Production P4_has time-span E52_Time-Span), the producer/creator (E12_Production P14_carried out by E39_Actor), the materials used (E12_Production P126_employed E57_Material), and the techniques used (E12_Production P32_use general technique E55_Type)s.
[22] Religious vestments have often been conceived as ensembles: for example, a chasuble is accompanied by a stole.
[23] A description of the panel preserved in the Musée des Arts Décoratifs can be found at the following address: https://ada.silknow.org/fr/object/99528c86-aac9-3231-a9d9-b84f6e4756fd.
[24] The term weave refers to the way in which the weft and warp threads are interwoven. This is called satin weave, for example. For more information on this term, please consult the SilkNow thesaurus: https://skosmos.silknow.org/thesaurus/en/page/318.
[25] The SilkNow data model proposes to use the two triplets S4_Observation O8_observed S4_Observation and S4_Observation P3_has note E62_String to model the data provided by the metadata fields corresponding to these analyses (cf. Figure 3).
[26] The weaving entry can be found in the SilkNow thesaurus: https://skosmos.silknow.org/thesaurus/en/page/526.
[27] Marie-Louise I, Duchess of Parma, and second wife of Napoleon I.
[28] Tenture, portières, cantonnières destinées au Grand salon de l'appartement de l'Impératrice à Versailles (1813, Bissardon, Cousin & Bony), Collections du Mobilier national, URL: https://collection.mobiliernational.culture.gouv.fr/objet/GMMP-877-001.
[29] The palm concept is defined in the thesaurus: https://skosmos.silknow.org/thesaurus/en/page/763.

Works Cited

Achichi et al 2018 Achichi, M., Lisena, P., Todorov, K., Troncy, R., and Delahousse, J. (2018) “DOREMUS: A Graph of Linked Musical Works”. ISWC: International Semantic Web Conference, (pp. 279-294). Monterey, CA, United States.
Alba et al 2019 Alba, E., Fernández, M., Gaitán, M., León, A., Portalés, C., Sebastián-Lozano, J., and Sevilla, J. (2019) “Technological tools for the conservation of silk heritage: Improving the conservation of European Religious Textile Cultural Heritage”. ESRARC 2019. 11th European Symposium on Religious Art, Restoration and Conservation (pp. 150-156). Kermes.
Alba et al 2020 Alba, E., Gaitán, M., León, A., and Sebastián, J. (2020) “Spanish Religious Textiles from the 18th and 19th Centuries”. The Garín Case. European Journal of Science and Theology, 16(1), 187-197.
Belhajjame et al 2013 Belhajjame, K., B'Far, R., Cheney, J., Coppens, S., Cresswell, S., Gil, Y., . . . Tilmes, C. (2013) “PROV-DM: The PROV Data Model”. W3C Recommendation 30 April 2013. (L. Moreau, and P. Missier, Éds.) Available at: https://www.w3.org/TR/prov-dm/
Beretta 2021 Beretta, F. (2021) “A challenge for historical research: making data FAIR using a collaborative ontology management environment (OntoME)”. Semantic Web - Interoperability, Usability, Applicability, 2(2), 279-294.
Bermès 2013 Bermès, E. (2013) “Enabling your catalogue for the Semantic Web”. Dans S. Chambers (Éd.), Catalogue 2.0 : the future of the library catalogue (pp. 117-142). Chicago: Neal-Schuman, an imprint of the American Library Association.
Berners-Lee, Hendler, and Lassila 2001 Berners-Lee, T., Hendler, J., and Lassila O. (2001) “The Semantic Web”. Scientific American, 5(284), 28-37.
Bianchini and Bargioni 2021 Bianchini, C. and Bargioni, S. (2021) “Automated Classification Using Linked Open”. Cataloging and, 59(8), 835-852. doi:10.1080/01639374.2021.1977447
Bizer, Heath, and Berners-Lee 2009 Bizer, C., Heath, T., and Berners-Lee T. (2009) “Linked Data: the Story so far”. International Journal on Semantic Web and Information Systems, 5, 1-22.
Christen 2012 Christen, P. (2012) Data Matching. Berlin, Heidelberg: Springer.
Claerr and Westeel 2010 Claerr, T. and Westeel, I. (2010) Numériser et mettre en ligne. Villeurbanne: Presses de l’enssib.
Clermont et al 2020 Clermont, D., Dorozynski, M., Wittich, D., and Rottensteiner, F. (2020) “Assessing the semantic similarity of images of silk fabrics using convolutional neural networks”. SPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences, (p. 641_648).
Doerr 2003 Doerr, M. (2003) “The CIDOC Conceptual Reference Model : An ontological approach to semantic interopérability of metadata”. AI Magazine(24), 75-92.
Dorozynski and Rottensteiner 2021 Dorozynski, M. and Rottensteiner, F. (2021) “SILKNOW Image Classification - final version WP6 - Multi-Task Classification (1.0)”. doi:https://doi.org/10.5281/zenodo.5091813
Dorozynski, Clermont, and Rottensteiner 2019 Dorozynski, M., Clermont, D., and Rottensteiner F. (2019) “Muti-task deep learning with incomplete training samples for the image-based prediction of variables describing silk fabrics”. ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences, IV(2/W6), 47-54.
Eastermann 2015 Eastermann, B. (2015) “Open data et crowdsourcing : un état des lieux du point de vue des musées”. La Lettre de l’OCIM(162), 41-46.
Foron-Dauphin and Cano 2016 Foron-Dauphin, N. and Cano, D. (2016) “Le patrimoine des Soieries Bonnet (Jujurieux, Ain) : conserver et valoriser in situ un ensemble unique de l’industrie textile”. In Situ Revues des patrimoines(29).
Fournier et al 2016 Fournier, M., Despierres, C., Mercandelli, C., and Bernard, E. (2016) “Conservation et présentation des lieux et collections de patrimoine textile : l’exemple de la fédération « Transmission du savoir-faire textile en Rhône-Alpes »”. {In Situ Revues des patrimoines(29).
Freire et al 2018 Freire, N., Meijers, E., Voorburg, R., and Isaac, A. (2018) “Aggregation of cultural heritage datasets through the Web of Data”. Procedia Computer Science, 120-126.
Freire et al 2019 Freire, N., Voorburg, R., Cornelissen, R., de Valk, S., Meijers, E., and Isaac, A. (2019) “Aggregation of Linked Data in the Cultural Heritage Domain: A Case Study in the Europeana Network”. Information, 10(8).
Freire, Manguinhas, and Isaac 2020 Freire, N., Manguinhas, H., and Isaac A. (2020) “An Observational Study of Equivalence Links in Cultural Heritage Linked Data for agents”. Digital Libraries for Open Knowledge : 24th International Conference on Theory and Practice of Digital Libraries, TPDL 2020, Lyon, France, August 25–27, 2020, Proceedings (pp. 62-70). Cham: Springer International Publishing.
Gaitán et al 2019 Gaitán, M., Alba, E., León, A., Pérez, M., Sevilla, J., and Portalés, C. (2019) “Towards the Preservation and Dissemination of Historical Silk Weaving Techniques in the Digital Era”. Heritage, 2(3), 1892-1911.
Grant, Nieuwenhuis, and Petersen 1995 Grant, A., Nieuwenhuis, J., and Petersen T. (1995) International guidelines for museum object information : the CIDOC information categories. s.l.: ICOM International Committee for Documentation.
Gruber 1993 Gruber, T. (1993) A Translation Approach to Portable Ontology Specifications. Stanford: Knowledge Systems Laboratory, Stanford University.
Hyvönen et al 2021 Hyvönen, E., Ikkala, E., Koho, M., Tuominen, J., Burrows, T., and Ransom, L. (2021) “Mapping Manuscript Migrations on the Semantic Web: A Semantic Portal and Linked Open Data Service for Premodern Manuscript Research”. ISWC 2021: The Semantic Web – ISWC 2021 (pp. 615-630). Cham: Springer.
Kergosien et al 2019 Kergosien, Eric, Daloz, A., Chaudiron, S., and Jacquemin, B. (2019) “Le CIDOC CRM, un modèle ontologique pour représenter les connaissances du bassin minier des Hauts de France”. Premiers résultats. 6th Hyperheritage International Symposium (HIS6) ”La fabrique du patrimoine à l’ère numérique". Paris.
Koch, Ribeiro, and Lopes 2020 Koch, I., Ribeiro, C., and Lopes C. (2020) “ArchOnto, a CIDOC-CRMBased Linked Data Model for the Portuguese Archives”. Dans M. Hall, T. Merčun, T. Risse, and F. Duchateau (Éds.), Digital Libraries for Open Knowledge. TPDL 2020. Lecture Notes in Computer Science (pp. 133-146). Cham: Springer.
Kritsotaki et al 2017 Kritsotaki, A., Hiebel, G., Theodoridou, M., Doerr, M., and Rousakis, Y. (2017) Definition of the CRMsci. An Extension of CIDOC-CRM to support scientific observation. Version 1.2.3.
Le Boeuf et al 2025 Le Boeuf, P., Doerr, M., Emil Ore, C., and Stead, S. (2015) Definition of the CIDOC Conceptual Reference Model. Version 6.2. Available at: https://cidoc-crm.org/
Léon et al 2019 Léon, A., Gaitán, M., Sebastian Lozano, J., Alba Pagán, E., and Insa, I. (2019) “SILKNOW”. Designing a thesaurus about historical silk for small and medium-sized textile museums. Science and Digital Technology for Cultural Heritage: Interdisciplinary Approach to Diagnosis, Vulnerability, Risk Assessment and Graphic Information Models. Proceedings of the 4th International Congress Science and Technology for the Conservation of Cult (pp. 187-190). London: CRC Press.
Marden et al 2013 Marden, J., Li-Madeo, C., Whysel, N., and Edelstein, J. (2013) “Linked open data for cultural heritage”. Proceedings of the 31st ACM International Conference on Design of Communication - SIGDOC ’13. the 31st ACM international conference (pp. 107–112). New York, NY, USA: Association for Computing Machinery.
Massri and Mladenić 2020 Massri, B. and Mladenić, D. (2020) “Extracting structured metadata from multilingual textual descriptions in the domain of silk heritage”. SiKDD 2020. Conference on Data Mining and Data Warehouses. Ljubjana.
Meroño-Peñuela et al 2015 Meroño-Peñuela, A., Ashkpour, A., van Erp, M., Mandemakers, K., Breure, L., Scharnhorst, A., . . . van Harmelen, F. (2015) “Semantic Technologies for Historical Research : a Survey”. Semantic Web, 6(6), 539-564.
Messaoudi et al 2019 Messaoudi, T., De Luca, L., Véron, P., and Halin, G. (2019) “Vers une ontologie de domaine pour l’analyse de l'état de conservation du bâti patrimonial”. In Situ. Revue des patrimoines(39).
Pagán et al 2020 Pagán, E. A., Gaitán Salvatella, M., Pitarch, M. D., León Muñoz, A., Moya Toledo, M., Ruiz, J. M., . . . Puren, M. (2020) “From Silk to Digital Technologies: A Gateway to New Opportunities for Creative Industries, Traditional Crafts and Designers”. The SILKNOW Case. Sustainability, 12(19).
Peroni, Tomasi, and Vitali 2013 Peroni, S., Tomasi, F., and Vitali F. (2013) “The aggregation of heterogeneous metadata in Web-based cultural heritage collections”. A case study. International Journal of Web Engineering and Technology, 8(4), 412-432.
Petrov 2019 Petrov, J. (2019) Fashion, History, Museums. Inventing the display of dress. London: Bloomsbury Academic.
Portalés et al 2018 Portalés, C., Sebastián, J., Alba, E., Sevilla, J., Gaitán, M., Ruiz, P., and Fernández, M. (2018) “Interactive Tools for the Preservation, Dissemination and Study of Silk Heritage — An Introduction to the SILKNOW Project”. Multimodal Technologies and Interaction, 2(2).
Puren and Vernus 2021a Puren, M. and Vernus, P. (2021) SIKNOW 0.1. Consulté le décembre 29, 2021, sur https://ontome.net/namespace/36
Puren and Vernus 2021b Puren, M. and Vernus, P. (2021) “Silknow Generic Profile”. Available at: http://ontome.dataforhistory.org/profile/7
Rei 2021 Rei, L. (2021) “SILKNOW Text Classifier Models”. Available at: https://doi.org/10.5281/zenodo.5070696
Schleider and Troncy 2021 Schleider, T. and Troncy, R. (2021) “Zero-Shot Information Extraction to Enhance a Knowledge Graph Describing Silk Textiles”. Proceedings of the 5th Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature. Punta Cana, Dominican Republic (online): Association for Computational Linguistics.
Schleider et al 2021b Schleider, T., Troncy, R., Ehrhart, T., Dorozynski, M., Rottensteiner, F., Sebastian, L., and Lo Cicero, G. (2021) “Searching Silk Fabrics by Images Leveraging on Knowledge Graph and Domain Expert Rules”. Proceedings of the 3rd Workshop on Structuring and Understanding of Multimedia Heritage Contents (pp. 41–49). New York, NY, USA: Association for Computing Machinery.
Schleider et al 2021c Schleider, T., Troncy, R., Gaitan, M., Sebastian Lozano, J., Mladenic, D., Kastelic, A., . . . Lo Cicero , G. (2021) “The SILKNOW Knowledge Graph”. Preprint.
Sebastián-Lozano et al 2023 Sebastián-Lozano, J., Pagán, E., Roig, E. M., Gaitán, M., León, A., Sevilla Peris, J., . . . Mladenic, D. (2023) “Open Access to Data about Silk Heritage: A Case Study in Digital Information Sustainability”. Sustainability, 15(19), 14340.
Stewart and Marcketti 2012 Stewart, T. and Marcketti, S. (2012) “Textiles, dress, and fashion museum website development: strategies and practices”. Museum Management and Curatorship, 27(5), 523-538.
Suls 2017 Suls, D. (2017) “Europeana Fashion: Past, present and future”. Libraries Journal, 42(3), 123–129.
Wang, Xia, and Niu 2014 Wang, Z., Xia, S., and Niu Q. (2014) “A novel ontology analysis tool, Applied Mathematics and Information Sciences”. Applied Mathematics and Information Sciences, 8(1), 255-261.
Preview  |  XMLPDFPrint