STA Presentation Website
Started: 04/04/2024
Description: This website is the website showcasing the livestream for people online to watch!
Thank you to Nicolas for amazing banner.



Web Editor Help
Started: 04/04/2024
Description: This project helps staff at COLA to search easily the right documentation with the use of Retrieval Augmented Generation. Currently, they use Google Dialogue when it only returned one matching result and when adding a new documentation, people need to add a lot of phrases to the document in hope that it can match these phrases with user’s search. Moreover, the model or the techniques they use with Google Dialogue is a black box so it is hard to improve the system. Therefore, I come in to help with upgrading the retrieval part of the system with using Cosine Similarity Search when I converted the documents into vector embedding using Sentence Embedding Model. Then, all of the retrieved relevant documentaries are fed into LLM so that it can generate helpful responses.




All of the functions that wrote:
__construct()
calling_hugging_face($text)
convertHelpCorpusAndStoreToDatabase()
storeEmbeddingIntoDatabase(array $embeddingVector, $path, $title)
processEmbedding(Request $request)
return_best_match_pgvector($embeddingData, $k = 5)
show_validation()
showForm()
The results:
Without intent:
[2025-03-25 21:13:03] Validation Top K: 1, Validation Score: 61, Total: 134 -> 45.5%
[2025-03-25 21:11:49] Validation Top K: 2, Validation Score: 82, Total: 134 -> 61.2%
With intent:
[2025-04-04 19:25:30] Validation Top K: 1, Validation Score: 73, Total: 134 -> 54.5%
[2025-04-04 19:26:44] Validation Top K: 2, Validation Score: 94, Total: 134 -> 70.1%
After eliminating not useful validation entry:
[2025-04-04 19:41:12] Validation Top K: 2, Validation Score: 92, Total: 126 -> 73%
[2025-04-04 19:42:55] Validation Top K: 1, Validation Score: 72, Total: 126 -> 57%
[2025-04-04 19:51:38] Validation Top K: 2, Validation Score: 92, Total: 126
[2025-04-04 19:53:53] Validation Top K: 2, Validation Score: 92, Total: 126