Tag Analysis and Standardization Report

Vault: mi-quartz/content
Date: 2025-11-24
Total Files Analyzed: 160 markdown files
Total Unique Tags: 420
Total Tag Occurrences: 826


Executive Summary

The vault currently has a moderate tagging system with 420 unique tags across 160 files. The analysis reveals several areas for improvement:

  1. Inconsistent naming conventions (singular vs plural, accents)
  2. No clear hierarchical structure (flat tagging system)
  3. Document type tags missing (no systematic use of “clase”, “concepto”, “resumen”)
  4. Course identification tags underutilized (only 1 file tagged with MAT1610)

Current State Analysis

1. Tag Distribution by Course

CourseFilesTop Tags
MAT1610 (Cálculo I)75 filescalculo (38), concepto (40), derivadas (16), limites (13)
MAT1203 (Álgebra Lineal)62 filesalgebra-lineal (34), determinante (6), transformaciones-lineales (6)
EYP1016 (Estadística)11 filesestadistica (6), eyp1016 (3)
Unknown12 filesNeeds course identification

2. Tag Distribution by Document Type

Document TypeFilesTop Tags
Clase72 filesalgebra-lineal (30), calculo (27), clase (24)
Concepto75 filesconcepto (40), derivada (12), calculo (11)
Resumen5 filesVarious summary tags
Unknown8 filesNeeds categorization

3. Most Common Tags (Top 20)

40  concepto
38  calculo
34  algebra-lineal
24  clase
16  derivadas
13  limites
12  derivada
 6  estadistica
 6  determinante
 6  transformaciones-lineales
 6  optimizacion
 5  sistemas-lineales
 5  espacios-vectoriales
 5  bases
 5  concepto-fundamental
 5  aplicaciones-derivada
 5  fisica
 5  maximos-minimos
 5  teorema-valor-medio
 5  antiderivadas

Identified Inconsistencies

1. Singular vs Plural Variants

SingularPluralRecommendation
determinantedeterminantesUse plural (determinantes)
basebasesUse plural (bases)
limitelimitesUse plural (limites)
derivadaderivadasUse plural (derivadas)
logaritmologaritmosUse plural (logaritmos)

Rationale: Plurals are more common and general for topics that encompass multiple concepts.

2. Accent Inconsistencies

With AccentWithout AccentRecommendation
cálculocalculoUse without accent (calculo)
límiteslimitesUse without accent (limites)

Rationale: Tags should use ASCII characters for better compatibility and searchability.

3. Number-Specific Tags (Should be Removed)

clase-02, clase-03, clase-04, ... clase-33
clase-5, clase-6, clase-7, clase-8, clase-9, clase-10

Recommendation: Remove class number tags. Use folder structure for organization instead.


Proposed Tag Taxonomy

Hierarchical Structure

courses/
├── mat1610-calculo/
├── mat1203-algebra/
└── eyp1016-estadistica/

document-types/
├── clase
├── concepto
├── resumen
├── ejercicios
└── formulas

topics/
├── calculo/
│   ├── limites/
│   │   ├── limites-laterales
│   │   ├── limites-infinitos
│   │   └── limites-notables
│   ├── derivadas/
│   │   ├── regla-cadena
│   │   ├── regla-producto
│   │   ├── regla-cociente
│   │   └── derivacion-implicita
│   ├── integrales/
│   │   ├── antiderivadas
│   │   ├── integral-definida
│   │   └── tfc
│   └── aplicaciones/
│       ├── optimizacion
│       ├── razones-relacionadas
│       └── aproximacion-lineal
│
├── algebra/
│   ├── vectores/
│   │   ├── operaciones-vectoriales
│   │   ├── producto-punto
│   │   └── combinaciones-lineales
│   ├── matrices/
│   │   ├── operaciones-matrices
│   │   ├── matriz-inversa
│   │   └── determinantes
│   ├── sistemas-lineales/
│   │   ├── forma-escalonada
│   │   ├── solucion-sistemas
│   │   └── consistencia
│   ├── espacios-vectoriales/
│   │   ├── subespacios
│   │   ├── bases
│   │   ├── dimension
│   │   └── independencia-lineal
│   └── transformaciones-lineales/
│       ├── matriz-transformacion
│       ├── valores-propios
│       └── diagonalizacion
│
└── estadistica/
    ├── probabilidad/
    │   ├── espacios-probabilidad
    │   ├── probabilidad-condicional
    │   └── independencia
    ├── variables-aleatorias/
    │   ├── variables-discretas
    │   ├── variables-continuas
    │   └── distribuciones
    └── estadistica-inferencial/
        ├── muestras
        └── teorema-central-limite

crosscutting/
├── concepto-fundamental
├── teorema
├── formula
├── aplicacion-practica
└── demostracion

Standardized Tagging Scheme

Tag Format Rules

  1. Lowercase only: All tags in lowercase
  2. No accents: Use ASCII characters only (calculo, not cálculo)
  3. Hyphens for separation: Use hyphens for multi-word tags (regla-cadena)
  4. Plural form: Use plural for topic tags (limites, derivadas, matrices)
  5. No class numbers: Don’t use clase-01, clase-02, etc.
  6. Hierarchical notation: Use forward slashes for hierarchy in future (calculo/limites)

Required Tags for Each File

Every file should have 3-5 tags minimum in this order:

  1. Course tag (always first)

    • mat1610-calculo
    • mat1203-algebra
    • eyp1016-estadistica
  2. Document type tag (always second)

    • clase
    • concepto
    • resumen
    • ejercicios
    • formulas
  3. Topic tags (3-5 specific tags)

    • Main topic
    • Subtopics
    • Related concepts
  4. Optional crosscutting tags

    • concepto-fundamental
    • teorema
    • aplicacion-practica

Examples

Example 1: Clase file

## Tags
#mat1610-calculo #clase #limites #limites-laterales #funcion-heaviside #continuidad

Example 2: Concepto file

## Tags
#mat1203-algebra #concepto #vectores #producto-punto #ortogonalidad

Example 3: Resumen file

## Tags
#mat1610-calculo #resumen #derivadas #aplicaciones-derivada #optimizacion #teorema-valor-medio

Tag Consolidation Map

Tags to Merge/Standardize

Current TagsStandardized TagCount
calculo, cálculocalculo41
limites, límites, limitelimites18
derivada, derivadasderivadas28
aplicaciones-derivada, aplicaciones-derivadasaplicaciones-derivada7
determinante, determinantesdeterminantes7
base, basesbases6
logaritmo, logaritmoslogaritmos3
limites-infinito, limites-infinitoslimites-infinitos4

Tags to Remove

Class-specific tags (62 tags total):

clase-02, clase-03, clase-04, ... clase-33
clase-5, clase-6, clase-7, clase-8, clase-9, clase-10

Overly generic tags:

  • clase (use folder structure + document type tag instead)
  • concepto (too generic, use specific concept names)

Topic-Specific Tag Guidelines

MAT1610 - Cálculo I

Límites

  • limites-laterales - Lateral limits
  • limites-infinitos - Infinite limits
  • limites-notables - Notable limits
  • definicion-epsilon-delta - Formal definition
  • asintotas-verticales - Vertical asymptotes
  • asintotas-horizontales - Horizontal asymptotes

Derivadas

  • regla-cadena - Chain rule
  • regla-producto - Product rule
  • regla-cociente - Quotient rule
  • derivacion-implicita - Implicit differentiation
  • derivadas-trigonometricas - Trig derivatives

Aplicaciones de Derivadas

  • optimizacion - Optimization problems
  • razones-relacionadas - Related rates
  • aproximacion-lineal - Linear approximation
  • teorema-valor-medio - Mean value theorem
  • regla-lhopital - L’Hôpital’s rule

Integrales

  • antiderivadas - Antiderivatives
  • integral-definida - Definite integral
  • tfc - Fundamental theorem of calculus
  • teorema-cambio-neto - Net change theorem

MAT1203 - Álgebra Lineal

Vectores

  • vectores-geometricos - Geometric vectors
  • producto-punto - Dot product
  • producto-cruz - Cross product
  • combinaciones-lineales - Linear combinations
  • independencia-lineal - Linear independence

Sistemas Lineales

  • forma-escalonada - Row echelon form
  • eliminacion-gauss - Gaussian elimination
  • consistencia - Consistency
  • solucion-sistemas - Solution sets

Matrices

  • operaciones-matrices - Matrix operations
  • matriz-inversa - Matrix inverse
  • determinantes - Determinants
  • matrices-elementales - Elementary matrices

Espacios Vectoriales

  • subespacios - Subspaces
  • bases - Bases
  • dimension - Dimension
  • espacio-nulo - Null space
  • espacio-columna - Column space

Transformaciones Lineales

  • matriz-transformacion - Transformation matrix
  • valores-propios - Eigenvalues
  • vectores-propios - Eigenvectors
  • diagonalizacion - Diagonalization

EYP1016 - Introducción a la Estadística

Probabilidad

  • espacios-probabilidad - Probability spaces
  • probabilidad-condicional - Conditional probability
  • independencia - Independence
  • teorema-bayes - Bayes’ theorem

Variables Aleatorias

  • variables-discretas - Discrete random variables
  • variables-continuas - Continuous random variables
  • distribucion-binomial - Binomial distribution
  • distribucion-normal - Normal distribution
  • distribucion-poisson - Poisson distribution

Estadística Inferencial

  • muestras-aleatorias - Random samples
  • teorema-central-limite - Central limit theorem
  • estimacion - Estimation
  • intervalos-confianza - Confidence intervals

Implementation Plan

Phase 1: Immediate Actions (Week 1)

  1. Standardize accent usage

    • Remove accents: cálculocalculo, límiteslimites
  2. Consolidate singular/plural

    • Use plural forms consistently
  3. Add course tags

    • Ensure every file has one of: mat1610-calculo, mat1203-algebra, eyp1016-estadistica
  4. Add document type tags

    • Tag all files with: clase, concepto, resumen, ejercicios, or formulas

Phase 2: Tag Enhancement (Week 2)

  1. Remove class-number tags
    • Delete all clase-01 through clase-33 tags
  2. Add missing topic tags
    • Ensure each file has 3-5 specific topic tags
  3. Standardize tag names
    • Apply consolidation map

Phase 3: Quality Assurance (Week 3)

  1. Verify tag consistency
    • Check all files follow new standards
  2. Update index/navigation
    • Create tag-based navigation if needed
  3. Document tag taxonomy
    • Create Tag_Taxonomy.md reference guide

Automation Recommendations

Python Script for Batch Tag Standardization

Create a script to:

  1. Find and replace accent variations
  2. Consolidate singular/plural
  3. Add missing course/document-type tags based on folder structure
  4. Remove class-number tags
  5. Generate report of changes

Obsidian Plugins (if applicable)

  • Tag Wrangler: For bulk tag renaming
  • Dataview: For tag-based queries and navigation
  • Tag Navigator: For hierarchical tag browsing

Benefits of Standardization

  1. Improved Searchability: Consistent tags make finding related content easier
  2. Better Organization: Hierarchical structure provides clear navigation
  3. Enhanced Connections: Related concepts across courses become visible
  4. Future-Proof: Scalable system for adding more courses/content
  5. Analysis-Ready: Enables data-driven insights on learning patterns

Maintenance Guidelines

Adding New Content

  1. Determine course, document type, and main topics
  2. Apply appropriate tags following the taxonomy
  3. Use existing tags when possible (check Tag_Taxonomy.md)
  4. Create new tags only when necessary and document them

Quarterly Review

  1. Run tag analysis script
  2. Identify new inconsistencies
  3. Update Tag_Taxonomy.md
  4. Consolidate rarely-used tags

Tag Creation Rules

  • Before creating a new tag, check if similar tag exists
  • New tags should be:
    • Specific and descriptive
    • Following naming conventions
    • Added to Tag_Taxonomy.md
    • Used in at least 2 files

Appendix: Complete Tag List by Category

Course Tags (3)

  • mat1610-calculo
  • mat1203-algebra
  • eyp1016-estadistica

Document Type Tags (5)

  • clase
  • concepto
  • resumen
  • ejercicios
  • formulas

MAT1610 Topic Tags (120 tags)

[Listed alphabetically with definitions in Tag_Taxonomy.md]

MAT1203 Topic Tags (85 tags)

[Listed alphabetically with definitions in Tag_Taxonomy.md]

EYP1016 Topic Tags (35 tags)

[Listed alphabetically with definitions in Tag_Taxonomy.md]

Crosscutting Tags (10)

  • concepto-fundamental
  • teorema
  • formula
  • aplicacion-practica
  • demostracion
  • ejercicio-resuelto
  • metodo
  • algoritmo
  • propiedad
  • definicion

Next Steps

  1. Review this report with stakeholders
  2. Approve tag taxonomy and standardization rules
  3. Create Tag_Taxonomy.md reference document
  4. Develop automation script for batch updates
  5. Test on small subset (10-20 files) first
  6. Apply to entire vault in phases
  7. Monitor and iterate based on usage patterns

Report Generated: 2025-11-24
Script Version: 1.0
Total Analysis Time: ~5 minutes
Files Analyzed: 160 markdown files