Hello... I have about 25,000 Word Docs where I would like to import just the text data from each file. Reading other forums, I have tried:
But my result is just a long binary string.
I tried extracting text from the binary:
But I just end up with this:
ÐÏࡱá
Any idea how I can import the plain text from each of these Word docs into an SQL table?
Code:
create table Documents
(DocName nvarchar(40),
DocContent Varbinary(max))
INSERT INTO Documents
SELECT N'D:\Brian\CW\Impact_Update_02162022\AllLGLToProcess\40273576.doc', BulkColumn
FROM OPENROWSET(BULK N'D:\Brian\CW\Impact_Update_02162022\AllLGLToProcess\40273576.doc', SINGLE_BLOB) blob
I tried extracting text from the binary:
Code:
select convert(varchar(max), convert(varbinary(max),doccontent)) FROM documents
But I just end up with this:
ÐÏࡱá
Any idea how I can import the plain text from each of these Word docs into an SQL table?