moosetechnology
diff --git a/‎_drafts/2025-03-05-a-real-example-on-using-tags.md‎
Lines changed: 63 additions & 48 deletions b/‎_drafts/2025-03-05-a-real-example-on-using-tags.md‎
Lines changed: 63 additions & 48 deletions
@@ -4,7 +4,7 @@ layout: post
 title: A real example on using tags
 header-img: img/posts/DSM.jpg
 subtitle: >-
-  In this post I show how one can use tags to perform a real, concrete, analysis. 
+  In this post I show step by step how one can use tags to perform a real, concrete, analysis. 
 date: 2025-03-05
 background: '/img/posts/bg-posts.jpg'
 author: Nicolas Anquetil
@@ -17,7 +17,7 @@ Tags can be a powerful tool to visualize things on legacy software and perform a
 For example, tags can be used to create virtual entities and see how they "interact" with the real entities of the system analyzed.
 In the article [Decomposing God Classes at Siemens](https://rmod-files.lille.inria.fr/Team/Texts/Papers/Anq19a-ICSME-GodClass.pdf) we show how tags can be used to create virtual classes and see their dependencies to real classes.
 
-In this post I will show another use of tags:  how they can materialize a concept and show its instanciation in a system.
+In this post I will show another use of tags:  how they can materialize a concept and show its instantiation in a system.
 
 The scenario is that of analysing Corese, a platform to "create, manipulate, parse, serialize, query, reason and validate RDF data."
 Corese is an old software that dates back to the early days of Java.
@@ -40,18 +40,18 @@ Let us see how Moose can help in the task.
 
 ### Where are the constants used?
 
-First, we need the source code of Corese-core ([https://github.com/corese-stack/corese-core](https://github.com/corese-stack/corese-core)).
-As usual we will create a model of this code:
+For an analysis in Moose, we need a model of the system, and this starts with getting the source code ([https://github.com/corese-stack/corese-core](https://github.com/corese-stack/corese-core)).
+The model is created using VerveineJ which can be run using docker:
 ```
 docker run -rm -v src/main/java/:/src ghcr.io/evref-bl/verveinej:latest -alllocals -o corese-core.json
 ```
 This will create a file `corese-core.json` in the directory `src/main/java/`.
-In the creation of the model we gave to VerveineJ the option `-alllocals`.
-This is because VerveineJ by default only tracks uses of variables with a non primitive type (variables containing objects).
-But here the constants are integers and if we want to know where they are used, we need more details.
+The command to create the model as an option `-alllocals`.
+This is because VerveineJ by default only tracks the uses of variables with non primitive type (variables containing objects).
+Here the constants are integers and if we want to know where they are used, we need more details.
 
 Let's import the model in Moose.
-This can be done simply by draging-and-droping the file in Moose.
+This can be done simply by dragging-and-dropping the file in Moose.
 
 !["Importing the Corese model"](/img/posts/2025-03-05-using-tags/corese-import.png)
 
@@ -68,23 +68,25 @@ public interface IStats {
 [...]
 ```
 
-First we need to find these constants to see where they are used.
-For this, we can inspect the model and look for all "Model Attributes".
-The constants are actually *attributes* of the interface/class in which they are defined.
+To find where the constants are used, we need to find the representation of the constants in the model.
+For this, we can inspect the model ("Inspect" button in the Model Browser) and look for all "Model Attributes".
+The constants are *attributes* of the interface/class in which they are defined as shown in the listing above).
 And they are *model attributes* because they are defined in the source code analysed, as opposed to `System.out` which may be used in the code but for which we don't have the source code.
 
 We can then select all the model attributes named PREDICATE:
 `select: [ :each | each name = 'PREDICATE']`.
-Moose gives us 8 different definitions of PREDICATE (and 9 of OBJECT, and 10 of SUBJECT).
-The one we are interested in is the 3rd in the list (IStats.PREDICATE).
+*(note, the backslash (\\) before the square bracket ([) was added by the publishing tool and is not part of the code)*
+
+Moose gives us 8 different definitions of PREDICATE (and 9 for OBJECT, and 10 for SUBJECT).
+The one we are interested in is the 3rd in the list (`IStats.PREDICATE`).
 
 !["All attributes named PREDICATE"](/img/posts/2025-03-05-using-tags/predicate-definitions.png)
 
-Having the same constants defined multiple times is not good news for the analysis and for the developpers.
+Having the same constants defined multiple times is not good news for the analysis and for the developers.
 But this kind of thing is fairly common in old systems which evolved during a long time in the hands of many developers.
 Not all of them had a complete understanding of the system and each had different skills and programming habits.
 
-Looking closer to the lists of definitions for the 3 main constants (SUBJECT, PREDICATE, OBJECT), we find that there are at least 5 different definitions of the same constants:
+Looking at the lists of definitions for the 3 main constants (SUBJECT, PREDICATE, OBJECT), we find that there are at least 5 different definitions of these constants:
 
 - stats.IStats:
 ```java
@@ -114,8 +116,8 @@ Looking closer to the lists of definitions for the 3 main constants (SUBJECT, PR
  ```java
 
         public static int TRIPLE 	 = 88;
-        public static int SUBJECT 	 = 89;
-        public static int PREDICATE 	 = 90;
+        public static int SUBJECT  = 89;
+        public static int PREDICATE = 90;
         public static int OBJECT 	 = 91;
 ```
 - kgram.core.Exp
@@ -133,78 +135,91 @@ Note: Don't close the Inspector window yet, we are going to need it soon.
 ### Tagging the constants and their uses
 
 Moose can help us here with tags.
-Tags are (as the name imply) just labels that we can attach to any entity in the model.
-Additionnaly, tags have a color that will help us distinguish them in visualizations.
+Tags are (as the name implies) just labels that can be attached to any entity in the model.
+Additionally, tags have a color that will help us distinguish them in visualizations.
 
 So let's tag our constants.
-We will define 5 tags, one for each "set of constant", that is to say one for each of the 5 classes that implement these constants.
-You can choose whatever name and whatever color you prefer for your tags, as long as you remember which is which.
+We will define 5 tags, one for each set of constants, that is to say one for each of the 5 classes that implement these constants.
+You can choose whatever name and color you prefer for your tags, as long as you remember which is which.
+Here I named the tags from the name of the classes that define each set of constant.
 
 !["The tags that represent each set of constant"](/img/posts/2025-03-05-using-tags/tags.png)
 
 Now we want to tag all the constants in a set with the same tag.
-Let's start with the IStats set, the one listed in the previous section and that was our initial focus.
+Let's see how to do it for constants in `IStats`, the ones listed in the previous section and that were our initial focus.
 
 We select the "IStats" tag in the Tag Browser and go back to the Inspector where we have a list of all definitions of PREDICATE.
 If we click on the 3rd of these PREDICATE ("fr::inria::corese::core::stats::IStats.PREDICATE"), a new pane appears on the right, focusing on this attribute.
 There, we can click on its "parentType", giving yet another pane.
+(The following screenshot shows the inspector right before we click on "parentType").
 
-!["The tags that represent each set of constant"](/img/posts/2025-03-05-using-tags/istats-predicate-attribute.png).
+!["The inspector while navigating to the set of attributes of IStats"](/img/posts/2025-03-05-using-tags/istats-predicate-attribute.png).
 
-The right pane now focuses on the IStats interface.
+The right pane now focuses on the `IStats` Java interface.
 We can click on "attributes" to get the list of attributes it defines (including PREDICATE from which we started).
-There are 5 attributes which are the one listed at the top of the blogpost.
+There are 5 attributes which are the ones listed in the previous section.
 
 So far so good.
 
-If we "propagate" this list of 5 attributes (toolbar button of the inspector on the right), the list will be propagated (!) to all tools that are in "Follow" mode.
+To tag these attributes, we will "propagate" them (toolbar button of the Inspector on the right) to all tools that are in "Follow" mode.
 Note that if you minimized the Tag Browser at some point, it will be in "Freeze" mode like in the screenshot above.
-You need to put it back in "Follow" (radio toolbar button on the left).
-Then the propagated list will appear in the center pane of the Tag Browser and you can pass it to the right pane of the Tag Browser with the "\>\>\>" button.
-You now have tagged these 5 constants with the "IStats" tag.
+You need to put it back in "Follow" (radio toolbar button on the left) before propagating the list of constants.
+
+Once propagated, the list appears in the center pane of the Tag Browser and you can pass it to the right pane with the "\>\>\>" button.
+Doing this will effectively tag the entities with the selected tag.
 
+We now have tagged these 5 constants with the "IStats" tag.
 Ideally we want to find also the usage of these constants.
+So we would like to also tag the methods that use these constants.
+
 For this you can open a Query Browser, it will start with the same list of 5 attributes that we just propagated.
-We can create a "Navigation query" and ask for all the "incoming" "accesse" to these attributes as shown below.
+We can create a "Navigation query" and ask for all the "incoming" "accesses" to these attributes as shown below.
 The result is a list of 6 methods.
 
 !["The methods accessing the 5 attributes propagated"](/img/posts/2025-03-05-using-tags/query-browser.png)
 
-We can also propagate these 6 methods and they will appear in the Tag Browser.
-We tag the methods accessing our 5 attributes with the same tag as the attributes themselves.
+We can now propagate these 6 methods and they will appear in the Tag Browser.
+We tag them with the same tag as the attributes themselves.
 
 You can repeat the same operations for the 5 sets of constants listed above and the 5 different tags.
 
 ### Visualizing the result
 
-The idea now is to visualize where each set of constant is used.
-For this, we will use an "Architectural Map" which is a fine tool to visualize tags.
+All this tagging was to be able to visualize where each set of constant is defined and, most importantly, used.
+We now turn to the "Architectural Map" which is a fine tool to visualize tags.
+for example, we could show all the top level packages of Corese and the Architectural Map will give visual clues on which ones contain tagged entities, and what tags.
+The Architectural Map allows to expand the content of entities which will allow us to deep dive into each package containing tagged entities to understand where exactly the entities is used or defined.
 
-First we want to select all the packages at the top level in Corese core.
-We go back one last time to the Inspector to the very first pane on the left (you may also "Inspect" again the model to open a new Inspector).
+To select all the top level packages, we go back one last time to the Inspector to the very first pane on the left (you may also "Inspect" again the model to open a new Inspector).
 We select the "Model packages" and enter this query in the "script" at the bottom: `self select: [ :each | each parentPackage isNotNil and: [each parentPackage name = 'core'] ]`.
-This gives us a list of 23 packages that we can propagate.
-Finally we open an Architectural Map on these 23 packages.
+*(Again, ignore the backslashes)*
 
-For easier visualization, I restricted the Architectural Map to the only 5 packages that do use the tags: "stats", "kgram", "util", "sparql", and "query".
-And I expanded "kgram" that is small and contains different tags.
+The result is a list of 23 packages that we can propagate.
+Finally we open an Architectural Map that will start with the 23 packages that we just propagated.
+
+In the following screenchot, I restricted the Architectural Map to the only 5 packages that do use our tags: "stats", "kgram", "util", "sparql", and "query".
+This makes it easier to see the results here.
+I also expanded "kgram" that is small and contains different tags.
 
 !["The packages using the 5 attributes"](/img/posts/2025-03-05-using-tags/architectural-map.png)
 
-The single color square, on the right of each package name, shows that it contains entities having one, sole tag of this color.
+The single-color square, on the right of each package name, shows that it contains entities having one uniq tag (of this color).
 In our case it means that it contains the constants and methods accessing them, all with the same tag.
+For example, "core" and "util" packages contain entities tagged with only the green tag (which corresponds to the `kgram.core.Exp` class as previously shown in the Tag Browser screenshot).
 
-When the square is multicolored, it means it contains entities with differrent tags.
+When the square is multicolored, it means it contains entities with different tags.
 For example, we see that the package "kgram" contains at least the green ("Exp") and the yellow ("Const") tags.
 
-Note that in this particular case, I added another tag for class `kgram.api.core.Node` which has its own definition of the OBJECT constant because I wanted to see where it was used also.
-This is the reason for the multicolor square of class "StatsBasedEstimation", in package "stats", which uses OBJECT form `Node` and the other constants from `IStats`.
-
+Note that in this particular case, I added another tag for class `kgram.api.core.Node` which has its own definition of the OBJECT constant.
+I wanted to see where it was used also.
+This is the reason for the multicolored square of class `StatsBasedEstimation`, in package "stats", which uses OBJECT from `Node` and the other constants from `IStats`.
 
 In the end, the visualization allows to conclude that each package sticks pretty much to its own definition of the constants which is rather reassuring.
 It also shows where one would have to look if we were to replace the constant by a real enum.
 
 This is not the end of it however because the constant values used in these methods can be passed off to other methods as argument.
-Here Famix alone (the dependency meta-model used in Moose by default) can no longer help us to follow the flow of usage of the constants because they are just integer being passed around.
-For a finer analysis, a real AST model should be used which is the purpose of the FAST meta-model (Famix-AST).
-But this is another story and falls outside of this blogpost
+Here Famix alone (the meta-model used in Moose by default) can no longer help us to follow the flow of usage of the constants because they are just integer being passed around.
+For a finer analysis, a complete AST model should be used.
+This could be done with the FAST meta-model (Famix-AST), but it is another story that falls outside the scope of this blog-post.
+
+See you latter.