Week 4 Worklog

Week 4 Objectives:

  • Research and standardize XML data according to Tax Authority regulations to build the project’s business foundation.
  • Develop and test core backend features (XML validation and parsing) using real-world invoice data.
  • Automate file storage processes on AWS S3 using Python Code (Boto3).
  • Stay updated with the latest AI/ML trends and solutions from the AWS re:Invent Recap event to support project development.
  • Optimize system security using AWS Systems Manager and implement invoice anti-spoofing features.

Tasks to be carried out this week:

DayTaskStart DateCompletion DateReference Material
2Web Servers & Project Analysis:
• Deploy a LAMP Stack on EC2
• Draft System Context Diagram
• Research XML Schema Definition (XSD)
26/01/202626/01/2026
3Backend Development (XML):
• Develop XML validation module in C#/.NET
• Refine XSD for diverse invoice formats
• Run unit tests with tuition invoices
27/01/202627/01/2026
4Compliance & S3 Automation:
• Implement risk logic (Decision 1510/QĐ-TCT)
• Automate S3 uploads using Python (boto3)
28/01/202628/01/2026https://000048.awsstudygroup.com/
5Professional Development:
• Attended AWS re:Invent Recap in HCMC
• Explored SageMaker, S3 Tables, and Vector Embeddings
29/01/202629/01/2026
6Security & Bug Fixing:
• Implement anti-spoofing and machine-invoice detection
• Use AWS Systems Manager for secure access
30/01/202630/01/2026

Week 4 Achievements:

  • Successfully deployed web servers on AL2023 and mastered AWS secure system administration tools.
  • Finalized the XML processing module with accurate validation and flexibility based on Tax Authority XSD standards.
  • Integrated invoice risk classification logic into the system, ensuring compliance with current electronic invoice regulations.
  • Mastered S3 storage automation using Python/Boto3 and integrated new AI technology insights into the project roadmap.
  • Completed anti-spoofing and machine-generated invoice detection features, fully resolving technical XML issues.